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15 Background 

Cell death is a fundamental aspect of animal 
development. Many cells die during the normal develop- 
ment of both vertebrates (Glucksmann, Biol. Rev. 
Cambridge Philos . Soc. 26:59-86 (1951)) and inverte- 
20 brates (Truman, Ann. Rev. Neurosci. 7:171-188 (1984)). 
These deaths appear to function in morphogenesis, 
metamorphosis and tissue homeostasis, as well as in the 
generation of neuronal specificity and sexual 
dimorphism (reviewed by Ellis et al., Ann. Rev. Cell 
25 Biol. 7:663-698 (1991)). An understanding of the 
mechanisms that cause cells to die and that specify 
which cells are to live and which cells are to die is 
essential for an understanding of animal development. 

The nematode Caenorhabditis elegans is an 
appropriate organism for analyzing naturally-occurring 
or programmed cell death (Horvitz et al., Neurosci. 
Comment. 1:56-65 (1982)). The generation of the 959 
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somatic cells of the adult C. elegans hermaphrodite is 
accompanied by the generation and subsequent deaths of 
an additional 131 cells (Sulston and Horvitz, Dev. 
Biol. 82:110-156 (1977); Sulston et al.. Dev. Biol. 
5 100:64-119 (1982)). The morphology of cells undergoing 
programmed cell death in C. elegans has been described 
at both the light and electron microscopic levels 
(Sulston and Horvitz, Dev. Biol. 82:100-156 (1977); 
Robertson and Thomson, J. Embryol. Exp. Morph. 67:89- 

10 100 (1982) ) . 

Many genes that affect C. elegans programmed cell 
death have been identified (reviewed by Ellis et al., 
Ann. Rev. Cell Biol. 7:663-698 (1991)). The activities 
of two of these genes, ced-3 and ced-4, are required 

15 for the onset of almost all C. elegans programmed cell 
deaths (Ellis and Horvitz, Cell 44:817-829 (1986)). 
When the activity of either ced-3 or ced-4 is , 
eliminated, cells that would normally die instead 
survive and can differentiate into recognizable cell 

20 types and even function (Ellis and Horvitz, Cell 

44:817-829 (1986); Avery and Horvitz, Cell 51:1071-1078 
(1987); White et al . , Phil. Trans. R. Soc. Lond. B. 
331:263-271 (1991)). Genetic mosaic analyses have 
indicated th^t the ced-3 and ced-4 genes most likely 

25 act in a cell autonomous manner within dying cells, 
suggesting that the products of these genes are 
expressed within dying cells and either are cytotoxic 
molecules or control the activities of cytotoxic 
molecules (Yuan and Horvitz, Dev. Biol. 138:33-41 

30 (1990)). 

Summary of the Invention 

This invention relates to genes shown to be 
essential for programmed cell death, referred to herein 
as cell death genes, to their encoded products (RNA and 
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polypeptides) , and to antibodies directed against the 
encoded polypeptides. Methods and probes for 
identifying and screening for other cell death genes, 
including those of vertebrates as well as 
invertebrates, and possibly, microbes and plants, are 
described. Agents which mimic or affect the activity 
of cell death genes and methods for identifying these 
agents are also described. Bioassays which detect the 
activity of cell death genes and which are useful for 
identifying cell death genes, for testing the effect of 
mutations in cell death genes, and for identifying 
agents which mimic or affect the activity of cell death 
genes are also provided. This invention further 
relates to methods for altering (increasing or 
decreasing) the activity of the cell death genes or 
their encoded products in cells and, thus, for altering 
the proliferative capacity or longevity of a cell 1 
population or organism. 

Specifically, the ced-3 and ced-4 genes of the 
nematode C. elegans have been identified, sequenced, 
and characterized. These genes have been shown to be 
required for almost all the programmed cell deaths 
which occur during development in C. elegans. Thus, 
two cell death genes and their encoded products (RNA, 
polypeptide) are now available for a variety of uses. 

As described herein, the ced-3 and ced-4 genes can 
be used to identify structurally related genes from a 
variety of sources. Some of these related genes are 
likely to also function as cell death genes. 
Structural comparison of related cell death genes, as 
well as mutational analysis, can provide insights into 
functionally important regions or features of cell 
death genes and gene products. This information is 
useful in the design of agents which mimic or which 
alter the activity of cell death genes. 



This invention further provides methods and agents 
for altering (increasing or decreasing) the occurrence 
of cell death in a cell population or organism. 
Methods and agents , described herein, which decrease 
cell death are potentially useful for treatment 
(therapeutic and preventive) of disorders and 
conditions characterized by cell deaths, including 
myocardial infarction, stroke, traumatic brain injury, 
degenerative diseases (e.g., Huntington's disease, 
amyotrophic lateral sclerosis, Alzheimer's disease, 
Parkinson's disease, and Duchenne's muscular 
dystrophy) , viral and other types of pathogenic 
infection (e.g., human immunodeficiency virus, HIV), 
aging and hair loss. Methods and agents which 
increase cell death are also provided and are 
potentially useful for reducing the proliferation or 
size of cell populations, such as cancerous cells, n 
cells infected with viruses (e.g., HIV) or other 
infectious agents, cells which produce autoreactive 
antibodies and hair follicle cells. Such methods and 
agents may also be used to incapacitate or kill 
undesired organisms, such as pests, parasites, and 
recombinant organisms. 

Brief Description of the Drawings 

Figure 1 shows the genomic organization and 
nucleotide sequence (Seq. ID #1) of ced-4 and deduced 
amino acid sequence (Seq. ID #2). The genomic sequence 
of the ced-4 region was obtained from plasmid C10D8-5, 
which rescues the ced-4 mutant phenotype. Two likely 
transcriptional start sites are marked with downward 
arrows. The start of the cDNA is marked with a solid 
arrowhead. The positions of eight ced-4 mutations are 
indicated by upward arrows. Numbers on the sides 
indicate nucleotide positions, beginning at the start 
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of C10D8-5. Numbers under the amino acid sequence 
indicate codon positions. Vertical lines between 
nucleotides indicate splice junctions. 

Figure 2 shows the genomic structure of the ced-4 
gene and positions of ced-4 mutations. The sizes of 
exons and introns are indicated in base pairs (bp) . 
The downward arrows indicate the positions of the Tc4 
insertion in the ced-4 (nl416) mutant and of eight EMS- 
induced mutations of ced-4. The arrow pointing right 
indicates the direction of transcription. The solid 
arrowhead indicates the translation initiation site. 
The open arrowhead indicates the ochre termination 
codon . 

Figure 3 shows the sequence similarities between 
the Ced-4 protein and some calcium-binding proteins. 
The consensus sequence of the calcium-binding loop is 
shown at the top. The positions indicated by X, Y ( Z, 
-X, and -Z correspond to vertices of an octahedron. 
The numbers above the X, Y, Z, -X and -Z correspond to 
the positions of the residues within the 29 amino acid 
EF-hand sequence. Amino acids are indicated by the 
single letter code. 0, amino acid with an oxygen- 
containing side chain. *, non-conserved amino acid. 
Positions Y/'z and -X can be any amino acid with 
oxygen-containing side chains. Position X is usually 
aspartic acid, and position -Z is usually glutamic 
acid. Conserved amino acids are shown in bold-face. 
Deviations from the EF-hand consensus sequence are 
underlined. 

Figure 4 shows the nucleotide sequence (Seq. ID 
#18) of ced-3 and deduced amino acid sequence (Seq. ID 
#19) . The genomic sequence of the ced-3 region was 
obtained from plasmid pJ107, which rescues the ced-3 
mutant phenotype. The likely translation initiation 
site is indicated by a solid arrowhead. The SL1 splice 
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acceptor of the RNA is boxed. The positions of 12 
ced-3 mutations are indicated. Repetitive elements in 
the introns are indicated as arrows above the relevant 
sequence. Numbers on the sides indicate nucleotide 
5 positions, beginning with the start of pJl07. Numbers 
under the amino acid sequence indicate codon positions. 

Figure 5A shows the genomic structure of the ced-3 
gene and the location of the mutations. The sizes of 
the introns and exons are given in bp. The downward 

10 arrows indicate the positions of 12 EMS-induced 

mutations of ced-3. The arrow pointing right indicates 
the direction of transcription. The solid arrowhead 
indicates the translation initiation site. The open 
arrowhead indicates the termination codon. 

15 Figure 5B shows the locations of the mutations 

relative to the exons (numbered 1-8) and the encoded 
serine-rich region. , 

Figure 6 is a Kyte-Doolittle hydrophobicity plot 
of the Ced-3 protein. 

20 Figure 7 shows a comparison of the Ced-3 proteins 

of C. elegans (line 1) and related nematodes, C. 
Jbriggrsae (line 2) and C. vulgaris (line 3). The 
conserved amino acids are indicated by " . " . Gaps 
inserted in the sequence for the purpose of alignment 

25 are indicated by . 

Figure 8 shows a restriction site map of the ced-4 
region and the relative positions of plasmid C10D8-5, 
plasmid insert pnl416, and three transcripts encoded by 
the region. 

30 Figure 9 shows physical and genetic maps of the 

ced-3 region on chromosome IV. 

Figure 10 summarizes experiments to localize ced-3 
within C48D1. Restriction sites of plasmid C48D1 and 
subclone plasmids are shown, ced-3 activity was scored 

35 as the number of cell corpses in the head of LI young 
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animals. ++, the number of cell corpses above 10. +, 
the number of cell corpses below 10 but above 2. 
the number of cell corpses below 2. 

Detailed Descrip tion of the Invention 

The ced-3 and ced-4 genes of C. elegans have been 
shown to be required for almost all programmed cell 
deaths in C. elegans development (Ellis and Horvitz, 
Cell 44:817-829 (1986)). The present work describes 
the cloning, sequencing and characterization of these 
genes. As a result of this work, two genes whose 
activities are required for cell death, referred to 
herein as cell death genes, and their encoded products 
(RNA, polypeptide) are available for a variety of uses. 
Described below are the cloning and characterization of 
the C. elegans ced-4 and ced-3 genes, methods and 
probes for identifying structurally related genes, 
methods for identifying cell death genes from a variety 
of organisms, methods for identifying agents which 
mimic or which affect the activity of cell death genes, 
and methods and agents for altering cell death activity 
and thus, for altering the occurrence of cell death in 
a cell population or organism. 

The activity of a cell death gene is intended to 
include the activity of the gene itself and of the 
encoded products of the gene. Thus, agents and 
mutations which affect the activity of a gene include 
those which affect the expression as well as the 
function of the encoded RNA and protein. The agents 
may interact with the gene or with the RNA or protein 
encoded by the gene, or may exert their effect more 
indirectly. 
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The Cpt^p 

The cloning, sequencing and characterization of 
the C. elegans ced-4 gene are described in Example 1. 
Genomic clones were obtained from a ced-4 mutant allele. 
5 generated by transposon tagging. A subclone containing 
as little as 4.4 kb of wild-type genomic DNA was shown 
to complement the ced-4 mutant phenotype (see Table 1; 
tables are located at the end of the Detailed 
Description) . 

10 A 2.2 kb mRNA was identified as the ced-4 

transcript. The transcript was shown to be present at 
normal levels in a ced-3 mutant, suggesting that ced-3 
is not a transcriptional regulator of ced-4 gene 
expression. Furthermore, the 2.2 kb transcript was 

15 shown to be expressed primarily during embryogenesis. 
This is consistent with the observation that 113 of the 
131 programmed cell deaths in C. elegans are embryonic 
(Sulston and Horvitz, Dev. Biol. 82:110-156 (1977); 
Sulston et al., Dev. Biol. 200:64-119 (1983)). 

20 cDNA clones were further obtained and sequenced. 

Analysis of the cDNA and its encoded product indicates 
that the putative Ced-4 protein is 549 amino acids in 
length (Figure 1; Seq. ID #2) and about 62,877 in 
relative molecular mass. The Ced-4 protein is highly 

25 hydrophilic, with a predicted pi of 5.12; there are no 
obvious transmembrane regions. The longest hydrophobic 
region is a segment of 12 amino acids from residues 382 
to 393. 

Sequence analysis of the ced-4 genomic clone and 
30 comparison with the cDNA sequence revealed that the 
ced-4 gene contains 7 introns with sizes ranging from 
44 bp to 557 bp (Figure 2) . 

The nucleotide sequences of eight EMS-induced 
ced-4 mutations were also determined. Of the eight 
35 mutations, one results in a single amino acid 
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substitution and the other seven appear to prevent 
either ced-4 RNA splicing or completion of Ced-4 
protein synthesis (Figure 2 and Table 2) . These seven 
mutations establish the null phenotype of the ced-4 
5 gene, confirming that ced-4 function is not essential 
for viability. 

Two regions of the inferred Ced-4 protein have 
sequence similarity to known calcium-binding domains 
(Kretsinger, Cold Spring Harbor Symp. Quant. Biol. 

10 52:499-510 (1987)), suggesting that Ced-4 activity and 
hence, programmed cell death may be modulated by 
calcium (see Figure 3 and Example 1) . Calcium has been 
implicated as an essential mediator of cell death in 
other organisms under a variety of conditions. For 

15 example, extracellular calcium is required for 

glucocorticoid- induced thymocyte death (Cohen and Duke, 
J. Immunol. 132:38-42 (1984)), for the deaths of adult 
rat hepatocytes induced by certain toxins in vitro 
(Schanne et al ./ Science 206:700-702 (1979)), for 

20 agonist-induced muscle degeneration in mice (Leonard 
and Salpeter, J. Cell Biol. 82:811-819 (1979)) and for 
neuronal cell death caused by oxygen deprivation or 
excitotoxicity (Coyle et al., Neurosci . Res. Prog. 
Bull. 15:331-427 (1981); Choi, J. Neurosci. 7:369-379 

25 (1987), Choi, Trends Neurosci. 11:465-469 (1988)). It 
is possible that programmed cell death is initiated 
during C. elegans development by an increase in 
intracellular calcium, which activates the Ced-4 
protein to become cytotoxic. On the other hand, 

30 certain cells seem to be protected against cell death 
by calcium (e.g., Koike et al., Proc. Natl. Acad. Sci. 
USA 85:6421-6425 (1989); Collins et al . , J. Neurosci. 
11 : 25^2-2587 (1991)), suggesting that increases in 
intracellular calcium levels may inhibit the activity 



to 



• * 



-10- 

of the Ced-4 protein and thereby prevent programmed 
cell death. 

The level of the ced-4 transcript in eggs is about 
20% that of the act in 1 transcript, which is relatively 
5 abundant (Edwards and Wood, Dev. Biol. 97:375-390 
(1983)). This level seems higher than might be 
expected if ced-4 were expressed only in dying cells, 
since in an embryo there are usually no more than two 
or three cells dying at the same time. These 
10 considerations suggest that ced-4 might be transcribed 
not only in dying cells but in other cells as well. 
Perhaps Ced-4 activity, at least during embryonic 
development, is regulated at a post-transcriptional 
level. For example, the Ced-4 protein might have to 
15 interact with other proteins or other factors (such as 
calcium) to cause cell death. Since the ced-3 gene is 
also essential for programmed cell death in C. elegans , 
one possibility is that the activity of the Ced-4 
W protein is dependent upon ced-3 function. 

frj The ced-3 Gene 

The cloning, sequencing and characterization of 
the ced-3 gene are described in Example 2. The ced-3 
gene was cloned by mapping DNA restriction fragment 
25 length polymorphisms (RFLPs) and chromosome walking. A 
7.5 kb fragment of genomic DNA was shown to complement 
ced-3 mutant phenotypes. A 2.8 kb transcript was 
further identified. The ced-3 transcript was found to 
be most abundant in embryos, but was also detected in 
30 larvae and young adults, suggesting that ced-3 is not 
only expressed in cells undergoing programmed cell 
death. 

A 2.5 kb cDNA corresponding to the ced-3 mRNA was 
sequenced. The genomic sequence was also determined 
35 (Figure 4; Seq. ID #18) and a comparison with the cDNA 
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sequence revealed that the ced-3 gene has 8 introns 
which range in size from 54 to 1195 bp (Figure 5A) . 
The four largest introns as well as sequences 5' of the 
start codon contain repetitive elements, some of which - 
have been previously characterized in non-coding 
regions of other C. elegans genes such as fem-1 (Spence 
et al., Cell 60:981-990 (1990)), lin-12 (J. Yochem, 
personnal communication), and myoD (Krause et al., Cell 
63:907-919 (1990)). The transcriptional start site was 
also mapped, and the ced-3 transcript was found to be 
trans-spliced to a C. elegans splice leader, SL1. 

Twelve EMS-induced ced-3 alleles were also 
sequenced. Eight of the mutations are missense 
mutations, two are nonsense mutations, and two are 
putative splicing mutations (Table 3). The molecular 
nature of these mutations, together with results of 
genetic and developmental analyses of nematodes 
homozygous for these mutations, indicate that, like 
ced-4, ced-3 function is not essential to viability. 
In addition, 10 out of the 12 mutations are clustered 
in the C-rterminal region of the gene (Figure 5B) , 
suggesting that this portion of the encoded protein may 
be important for activity. 

The ced-3 gene encodes a putative protein of 503 
amino acids (Figure 4; Seq. ID #19). The protein is 
very hydrophilic and no significantly hydrophobic 
region can be found that might be a transmembrane 
domain (Figure 6) . One region of the ced-3 protein is 
very rich in serine. Sequence comparison of two 
additional ced-3 genes from related nematodes, C. 
jbrigrgrsae and C. vulgaris, suggests that the exact 
sequence in this serine-rich region may not be 
important but that the serine-rich feature is (Figure 
7; Seq. ID #19-21). This hypothesis is supported by 
the analysis of ced-3 mutations: none of 12 EMS-induced 
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cecf-3 mutations is in the serine-rich region (Figure 
5B) . 

The conservation of the serine-rich feature among 
the ced-3 genes of different nematodes suggests that 
5 the serine-rich region may act in semi-specific 

protein-protein interactions, similar to acid blobs in 
transcription factors and basic residues in nuclear 
localization signals. In all these cases, the exact 
primary sequence is not important. 

10 It is possible that the serine residues in the 

Ced-3 and Ced-4 proteins may be targets for a Ser/Thr 
kinase, and that the activity of these proteins may be 
regulated post-translationally by protein 
phosphorylation. McConkey et al . (J. Immunol., 

15 145:1227-1230 (1990)) have shown that phorbol esters, 
which stimulate protein kinase C, can block the death 
of cultured thymocytes induced by exposure to Ca ++ , 
ionophores or glucocorticoids (Wyllie, Nature 284:555- 
556 (1980); Wyllie et al., J. Path. 142:67-77 (1984)). 

20 It is possible that protein kinase C may inactivate 
certain cell death proteins by phosphorylation, and 
thus, inhibit cell death and promote cell 
proliferation. Several agents that can elevate 
cytosolic cAftP levels have been shown to induce 

25 thymocyte death, suggesting that protein kinase A may 
also play a role in mediating thymocyte death. Further 
evidence suggests that abnormal phosphorylation may 
play a role in the pathogenesis of certain cell- 
degenerative diseases. For example, abnormal 

30 phosphorylation of the microtubule-associated protein 
Tau is found in the brains of Alzheimer's disease and 
Down's syndrome patients (Grundke-Iqbal et al., Proc. 
Natl. Acad. Sci. USA 83:4913-4917 (1986); Flament et 
al., Brain Res. 515:15-19 (1990)). Thus, it is 

35 possible that phosphorylation may have a role in 
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regulating programmed cell death in C. elegans. This 
is consistent with the fairly high levels of ced-3 and 
ced-4 transcripts which suggest that transcription 
regulation alone may be insufficient to regulate 
5 programmed cell death. 

Structurally and Functionally Related Genes 

As a result of the work described herein , it is 
possible to identify genes which are structurally 
10 and/or functionally related to ced-3 or ced-4. Such 
genes are expected to be found in a variety of 
organisms, including vertebrates (e.g., mammals and 
particularly humans), invertebrates (e.g., insects), 
□ microbes (e.g., yeast) and possibly plants, 

g 15 Structurally related genes refer herein to genes which 

m have some structural similarity to the nucleotide 

f: sequences (genomic or cDNA) of one or both of the ced-3 

Si or genes, or whose encoded proteins have some 

^ similarity to one or both of the amino acid sequences 

u 20 of the Ced-3 or Ced-4 proteins. Functionally related 

[U genes refer to genes which have similar activity to 

y that of ced-3 and ced-4 in that they cause cell death. 

O Such genes can be identified by their ability to 

complement ced-3 or ced-4 mutations in bioassays, as 
25 described below. 

Previous studies are consistent with the 
hypothesis that genes similar to the C. elegans ced-3 
and ced-4 genes may be involved in the cell deaths that 
occur in both vertebrates and invertebrates. Some 
30 vertebrate cell deaths share certain characteristics 
with the programmed cell deaths in C. elegans that are 
controlled by ced-3 and ced-4. For example, up to 14% 
of the neurons in the chick dorsal root ganglia die 
immediately after their births, before any signs of 
35 differentiation (Carr and Simpson, Dev. Brain Res. 




it 



-14- 



10 

iU 15 

i - 

SJ 

m 
□ 

-.3 

25 
30 
35 



2:57-162 (1982)). Genes like ced-3 and ced-4 could 
well function in this class of vertebrate cell death. 
In addition, genes related to ced-3 and ced-4 could 
function in many other types of vertebrate cell death 
processes, including those involving cells that die 
long after their births and those that die as a result 
of stress (e.g., oxygen deprivation) or disease. 

Genetic mosaic analysis has suggested that the 
ced-3 and ced-4 genes act within cells that undergo 
programmed cell death, rather than through cell-cell 
interactions or diffusible factors (Yuan and Horvitz, 
Dev. Biol. 138:33-41 (1990)). Many cell deaths in 
vertebrates seem different in that they appear to be 
controlled by interactions with target tissues. For 
example, it is thought that a deprivation of target- 
derived growth factors is responsible for vertebrate 
neuronal cell deaths (Hamburger and Oppenheim, , 
Neurosci. Comment. 1:39-55 (1982)); Thoenen et al., in: 
Selective Neuronal Death, Wiley, New York, 1987, Vol. 
12 6, pp. 82-85) . However, even this class of cell 
death could involve genes like ced-3 and ced-4, since 
pathways of cell death involving similar genes and 
mechanisms might be triggered in a variety of ways. 
Supporting this idea are several in vitro and in vivo 
studies which show that the deaths of vertebrate as 
well as invertebrate cells can be prevented by 
inhibitors of RNA and protein synthesis, suggesting 
that activation of genes is required for these cell 
deaths (Martin et al . , J. Cell Biol. 106:829-844 
(1988); Cohen and Duke, J. Immunol. 132:38-42 (1984); 
Oppenheim and Prevette, Neurosci. Abstr. 14:368 (1988); 
Stanisic et al . , Invest. Urol. 16:19-22 (1978); 
Oppenheim et al . , Dev. Biol. 138:104-113 (1990); 
Fahrbach and Truman, in: Selective Neuronal Death, Ciba 
Foundation Symposium, 1987, No. 126, pp. 65-81). It is 
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possible that the genes induced in these dying 
vertebrate and invertebrate cells are cell death genes 
similar to the C. elegans genes ced-3 and ced-4. 

Also supporting the hypothesis that cell death in 
C. elegans is mechanistically similar to cell death in 
vertebrates is the observation that the protein product 
of the C. elegans gene ced-9 is similar in sequence to 
the human protein Bcl-2. ced-9 has been shown to 
prevent cells from undergoing programmed cell death 
during nematode development by antagonizing the 
activities of ced-3 and ced-4 (Hengartner, et al . , 
Nature 356:494-499 (1992)). The bcl-2 gene has also 
been implicated in protecting cells against cell death. 
It seems likely that the genes and proteins with which 
ced-9 and jbcl-2 interact are similar as well. 

Genes which are structurally related to ced-3 or 
ced-4 are likely to also act as cell death genes. i 
Structurally related genes can be identified by any 
number of detection methods which utilize a defined 
nucleotide or amino acid sequence or antibodies as 
probes. .For example, nucleic acid (DNA or RNA) con- 
taining all or part of the ced-3 or ced-4 gene can be 
used as hybridization probes or as polymerase chain 
reaction (PCR) primers. Degenerate oligonucleotides 
derived from the amino acid sequence of the Ced-3 or 
Ced-4 proteins can also be used. Nucleic acid probes 
can also be based on the consensus sequences of 
conserved regions of genes or their protein products. 
In addition, antibodies, both polyclonal and 
monoclonal, can be raised against the Ced-3 and/or Ced- 
4 proteins and used as immunoprobes to screen 
expression libraries of genes. 

One strategy for detecting structurally related 
genes in other organisms is to initially probe animals 
which are taxonomically closely related to the source 
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of the probes, for example, probing other worms with a 
ced-3 or ced-4 probe . Closely related species are more 
likely to possess related genes or gene products which 
are detected with the probe than more distantly related- 
5 organisms. Sequences conserved between ced-3 or ced-4 
and these new genes can then be used to identify 
similar genes from less closely related species. 
Furthermore, these new genes provide additional 
sequences with which to probe the molecules of other 

10 animals, some of which may share conserved regions with 
the new genes or gene products but not with ced-3 , 
ced-4, or their gene products. This strategy of using 
structurally related genes in taxonomically closer 
organisms as stepping stones to genes in more distantly 

15 related organisms can be referred to as walking along 
the taxonomic tree. 

Groups of structurally related genes, such as * 
those obtained by using the above-described strategy, 
can be referred to as gene families. Comparison of 

2 0 members within a gene family, or their encoded 

products, may indicate functionally important features 
of the genes or their gene products. Those features 
which are conserved are likely to be significant for 
activity. Such conserved sequences can then be used 
25 both to identify new members of the gene family and in 
drug design and screening. For example, as described 
in Example 2, genes similar to ced-3 from two other 
species of nematodes (C. briggsae and C. vulgaris) were 
identified and characterized. Serine-rich regions were 

3 0 found in the polypeptides encoded by all three genes. 

Although the sequence of the serine-rich region was not 
well conserved, the number of serines was conserved, 
suggesting that the serine-rich feature, but not the 
exact sequence of the serine-rich region, is 
35 significant for function. 
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Functionally important regions can also be 
identified by mutagenesis • For example, inactivating 
mutations of ced-3 were found to cluster within a 
region near the COOH-terminus (Figure 5B) , suggesting 
that this region is a functionally important domain of 
the Ced-3 protein. Further mutational analyses can be 
carried out on the ced-3 and ced-4 genes; mutants with 
novel properties, as well as other regions important 
for activity, may be discovered. Mutations and other 
alterations can be accomplished using known methods, 
such as in vivo and in vitro mutagenesis (see, e.g., 
Ausubel et al . (eds.), Current Protocols in Molecular 
Biology, Greene Publishing Associates and Wiley- 
Interscience, New York) . 

Bioassa vs and Agents Which Affect the Activity of Cell 
Death Genes 

This invention further provides bioassays which 
detect the activity of cell death genes. The bioassays 
can be used to identify novel cell death genes, to 
identify mutations which affect the activity of cell 
death genes, to identify genes which are functionally 
related to known cell death genes, such as ced-3 or 
ced-4, to identify genes which interact with cell death 
genes, and to identify agents which mimic or affect the 
activity of cell death genes (e.g., agonists and 
antagonists). For example, the bioassays can be used 
to screen expression gene libraries for cell death 
genes from other organisms. 

In this bioassay, genes or agents are introduced 
into nematodes to test their effect on cell deaths in 
vivo. Wild-type, mutant, and transgenic nematodes can 

> t: i 

be used as appropriate for the effect being tested. In 
one embodiment of this bioassay, transgenic nematodes 
are produced using a candidate cell death gene, a 
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mutant cell death gene, or genes from an expression 
library, to observe the effect of the transgene on the 
pattern of programmed cell deaths during development of 
the nematode. For example, a gene which is 
structurally related to ced-3 can be used to produce a 
transgenic animal from a mutant nematode which 
underexpresses or expresses an inactivated ced-3 gene 
to see if the related gene can complement the ced-3 
mutation and is thus, functionally as well as 
structurally related to ced-3. cDNA or genomic 
libraries can be screened for genes having cell death 
activity. Genes which interact with cell death genes 
to enhance or suppress their activity can also be 
identified by this method. 

In another embodiment of the bioassay, wild-type, 
mutant, or transgenic nematodes are exposed to or 
administered peptides and other molecules in order ^to 
identify agents that mimic, increase, or decrease the 
activity of a cell death gene. For example, wild-type 
animals can be used to test agents that inactivate or 
antagonize the activity of ced-3 or ced-4 and hence, 
decrease cell deaths, or that activate or enhance ced-3 
or ced-4 activity and increase cell deaths. Mutant 
animals in which ced-3 or ced-4 is inactivated can be 
used to identify agents or genes which mimic ced-3 or 
ced-4 in causing cell deaths. Mutant animals in which 
ced-3 or ced-4 is overexpressed or constitutively 
activated can similarly be used to identify agents that 
prevent ced-3 or ced-4 from causing cell death. 
Transgenic animals in which a wild-type or mutant form 
of an exogenous cell death gene causes excess cell 
deaths due to overexpression or hyperactivity can be 
used to identify agents that inactivate or inhibit the 
activity of the transgene. Similarly, transgenic 
animals in which a wild-type or mutant form of an 



if 




-19- 



exogenous cell death gene is underexpressed or inactive 
can be used to identify agents that activate or 
increase its activity. Test molecules can be 
introduced into nematodes by microinjection, diffusion, - 
5 ingestion, shooting with a particle gun, or other 
method. 

Mutated cell death genes with novel properties may 
be identified by the above bioassay. For example, 
constitutively activated or hyperactive cell death 

10 genes may be isolated which may be useful as agents to 
increase cell deaths. Mutations may also produce genes 
which do not cause cell death but which antagonize the 
activity of the wild-type gene. 

Agents can be obtained from traditional sources, 

15 such as extracts (e.g., bacterial, fungal or plant) and 
compound libraries, or by newer methods of rationale 
drug design. Information on functionally important 
regions of the genes or gene products, gained by 
sequence and/or mutational analysis, as described 

2 0 above, may provide a basis for drug design. The 

activity . of the agents can be verified both by in vivo 
bioassay s using nematodes which express various forms 
of ced-3, ced-4, or related genes, as described above, 
and by in vitro systems, in which the genes are 

25 expressed in cultured cells, or in which isolated or 
synthetic gene products are tested directly in 
biochemical experiments. The agents may include all or 
portions of the ced-3, ced-4, or related genes, mutated 
genes, and all or portions of the gene products (RNA, 

30 including antisense RNA, and protein) , as well as 
nucleic acid or protein derivatives, such as 
oligonucelotides and peptides, peptide and non-peptide 
mimetics, and agonists and antagonists which affect the 
activity or expression of the cell death genes. The 

35 agents can also be portions or derivatives of genes or 
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gene products which are not cell death genes but which 
regulate the expression of , interact with, or otherwise 
affect the function of cell death genes or gene 
products. 

5 Uses of the Invention 

Using the above-described probes and bioassays, 
the identification and expression of ced-3 , ced-4 or 
related cell death genes in cultured cells, tissues, 
and whole organisms can be studied to gain insights 

10 into their role in development and pathology in various 
organisms. For example, the detection of abnormalities 
in the sequence, expression, or activity of a cell 
death gene or gene product may provide a useful 
diagnostic for diseases involving cell deaths. 

15 This invention further provides means of altering 

or controlling the activity of a cell death gene in a 
cell, and, thus, affecting the occurrence of cell 
death. Activity of the cell death gene can be altered 
to either increase or decrease cell deaths in a 

2 0 population of cells and, thus, affect the proliferative 
capacity or longevity of a cell population, organ, or 
entire organism. 

Agents which act as inactivators or antagonists of 
the activity of ced-3, ced-4, or other cell death genes 

25 can be used to prevent or decrease cell deaths. Such 
agents are useful for treating (i.e., for both 
preventive and therapeutic purposes) disorders and 
conditions characterized by cell deaths, including 
neural and muscular degenerative diseases, stroke, 

30 traumatic brain injury, myocardial infarction, viral 
(e.g., HIV) and other types of pathogenic infections, 
as well as cell death associated with normal aging and 
hair loss. The agent can be delivered to the affected 
cells by various methods appropriate for the cells or 
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organs being treated, including gene therapy. For 
example, anti-sense RNA encoded by all or a part of a 
cell death gene which is complementary to the mRNA can 
be delivered to a population of cells by an appropriate - 
5 vector, such as a retroviral or adenoviral vector, or 
an antagonist of cell death activity can be infused 
into a wound area to limit tissue damage* 

Methods and agents which cause or increase cell 
deaths are also useful, for example, for treating 

10 disorders characterized by an abnormally low rate or 

number of cell deaths or by excessive cell growth, such 
as neoplastic and other cancerous growth. Such methods 
and agents are also useful for controlling or 
eliminating cell populations, such as cells infected 

15 with viruses (e.g., HIV) or other infectious agents, 
cells producing autoreactive antibodies, and hair 
follicle cells. In addition, methods and agents which 
increase cell death can be used to kill or incapacitate 
undesired organisms, such as pests, parasites and 

20 genetically engineered organisms. All or portions of 
ced-3, ce.d-4, or related cell death genes, active 
mutant genes, their encoded products, agents which 
mimic the activity of cell death genes, and activators 
and agonists' of cell death genes can be used for this 

25 purpose. 

For example, cell death genes can be used to kill 
cells infected with the human immunodeficiency virus 
(HIV), and thus, prevent or limit HIV infection in an 
individual. A recombinant gene can be constructed, in 
30 which a cell death gene is under the control of a viral 
promoter which is specifically activated by a viral 
protein^ the recombinant gene is introduced into HIV 
infected cells. HIV-infected cells containing the 
viral activator protein would express the cell death 
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gene product and be killed, and uninfected cells would 
be unaffected. 

Alternatively, an antagonist of ced-3 or ced-4 
activity (such as antisense RNA) can be expressed under- 
5 the control of a viral-specific promoter and in this 
way, be used to prevent the cell death associated with 
viral (e.g., HIV) infection. 

In another example, cell death genes can be used 
as suicide genes for biological containment purposes. 

10 Genetic engineering of suicide genes into recombinant 
organisms has been reported in bacteria (Genetic 
Engineering News, Nov. 1991, p. 13): suicide genes 
were engineered to be expressed simultaneously with the 
desired recombinant gene product so that the 

15 recombinant bacteria die upon completion of their task. 
The present invention provides suicide genes which are 
useful in a variety of organisms in addition to 
bacteria, for example in insects, fungi, and transgenic 
rodents. Suicide genes can be constructed by placing 

20 the coding sequence of an exogenous cell death gene or 
an agonist of an endogenous cell death gene of the 
organism in an expression vector suitable for the 
organism. 

In addition, agents which increase cell death are 
25 useful as pesticides (e.g., anthelminthics, 

nematicides) . For example, many nematodes are human, 
animal, or plant parasites, ced-3, ced-4, or other 
nematode cell death genes, their gene products, 
mimetics, and agonists can be used to reduce the 
3 0 nematode population in an area, as well as to treat 
individuals already infected with the parasite or 
protect individuals from infection. A transgenic plant 
or animal carrying a constitutively activated ced-3 
gene, ced-4 gene, or other cell death gene specific to 



nematodes can be protected from nematode infection in 
this way. 

The subject invention will now be illustrated by 
the following examples, which are not intended to be 
limiting in any way. 2 

EXAMPLE 1 

CLONING. SEQUENCING AND CHARACTERIZATION OF 
THE CED-4 GENE 

MATERIALS AND METHODS 

General Methods and Strains 

Techniques used for the culturing of C. elegans 
were essentially as described by Brenner (Genetics 
77:71-94 (1974)). All strains were grown at 20°C. DNA 
was prepared from worms grown on Petri dishes , 
containing ajgarose seeded with E. coli strain HB101. 
RNA was prepared from mass cultures grown in liquid. 
Usually, the bacterial pellet from a 2 L overnight 
culture of E. coli HB101 grown in superbroth (12 g 
Bacto-tryptone, 24 g yeast extract, 8 ml 50% glycerol, 
900 ml H 2 0; after autoclaving, 100 ml 0.17 M KH 2 HP0 4 
and 0.72 K 2 HP0 4 were added) was resuspended in 500 ml S 
basal medium (Brenner, 1974 supra) , and worms were 
added from one or two 10 cm Petri dishes in which the 
bacterial lawns had just been consumed. Worms were 
harvested about 4-5 days later by centrifugation and 
washed in M9 buffer (Brenner, 1974 supra) . The yield 
was about 5-10 ml of packed worms. 

Nomarski differential interference contrast 
microscopy was used to examine individual cells in 
living nematodes (Sulston and Horvitz, Dev. Biol. 
82:110-156 (1977)). Methods for scoring the Ced 
phenotype of ced-I, ced-4 and ced-1; ced-4 double 
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mutants have been described by Ellis and Horvitz, (Cell 
44:817-829 (1986)) and by Yuan and Horvitz, (Dev. Biol. 
138:33-41 (1990) ) . 

The wild-type parent of all mutant strains used in . 
these experiments was C. elegans variety Bristol strain 
N2 (Brenner, 1974 supra) . The genetic markers used are 
listed below. These markers have been described 
(Brenner, 1974 supra; Hodgkin et al., in: The Nematode 
Caenorhabditis elegans, Wood and the Community of C. 
elegans Researchers (eds.), Cold Spring Harbor 
Laboratory, New York, 1988, pp. 491-584; Finney et al . , 
Cell 55:757-769 (1988)). The strain TR679 carries the 
mutator mut-2 (r459) (Collins et al., Nature 328:726-728 
(1987)). The ced-4 alleles nl894, n!920, n!947, nl948, 
n2247 , and n2273 were characterized in the present 
work. Genetic nomenclature follows the standard system 
for C. elegans (Horvitz et al., Mol. Gen. Genet. , 
175:129-133 (1979)): 

LG I: ced-l(el735) , unc-54(r323) 

LG III: unc-86(n!351) , ced-4 (n!162, n!416, 
nl894 f nl920, nl947, nl948, n2247, 
n2273, nl416 n!712, n!416 n!713) , 
unc-79 (el068) , dpy-17 (e!64 ) 
LG IV: unc-31(e928) , ced-3 (n717) 
LG V: egl-l(n986) , unc-76 (e911) 



Genomic Libraries 

A 4-6 kb size-selected phage library was 
constructed from ced-4 (nl416) DNA as follows. Genomic 
DNA was digested with Hindlll and run on a low-melting 
30 agarose gel. DNA migrating within the 4-6 kb size 
range was excised, and the low-melting agarose was 
removed by phenol extraction and precipitation 
(Maniatis et al., Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory (1983)). These 
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DNA fragments were ligated to tfindlll-digested DNA from 
phage A.NM1149 (Murray, Phage Lambda and Molecular 
Cloning, Cold Spring Harbor Laboratory, 1983, pp. 395- 
432) . The product DNA was packaged with packaging 
5 extract from Promega. This library had a total of 
140,000 plaque-forming units (pfu) , of which 70% were 
recombinants, as estimated from the ratio of pfu on 
bacteria C600hfl and C600. 

The phage genomic library (provided by J. Sulston) 
10 was prepared by partial digestion of wild-type C. 
elegans genomic DNA with Sau3A and cloning into the 
Ba/nHI site of phage vector X2001 (Karn et al., Gene 
32:217-224 (1984) ) . 

Tc4 Probe 

15 The Tc4 probe used for cloning the ced-4 gene and 

for Southern blots was Tc4-nl351, which contains a ,Tc4 
element isolated from an unc-86 (n!351) mutant strain 
(Finney et al., Cell 55:757-769 (1988); Yuan et al., 
Proc. Natl. Acad. Sci. USA 88:3334-3338 (1991)). DNA 

20 was labelled with 32 P using either the nick-translation 
procedure described by Maniatis et al. (1983 supra) or 
the oligo-labelling procedure described by Feinberg and 
Vogelstein (Anal. Biochem, 132:6-13 (1983)). 

RNA Preparation, Northern Blot and Primer Extension 
25 Total C. elegans RNA was extracted using guanidine 

isothiocyanate (Kim and Horvitz, Genes & Dev. 4:357-371 
(1990)). Poly(A) + RNA was selected from total RNA by a 
poly (dT) -column (Maniatis et al., 1983 supra). To 
prepare stage-synchronized animals, eggs were obtained 
30 from gravid C. elegans adults grown at 20°C in liquid 
culture. A 5 - 10 ml sample of animals was treated 
with 50 ml of NaOCl/NaOH solution (10 ml NaOCl, 1 g 
NaOH, 40 ml H 2 0) for about 10 minutes with vortexing 
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until the adults were dissolved. Eggs were centrifuged 
and washed three times with M9 buffer. Isolated eggs 
were allowed to hatch in S basal medium without food 
for 14 hours at 20 °C with shaking. LI larvae were 
5 collected by low-speed centrifugation after growth on 
E. coli HB101 for 2 hours, L2 larvae after 12 hours, L3 
larvae after 24 hours, L4 larvae after 36 hours and 
adults after 48 hours. Northern blot analysis using 
DNA probes was performed essentially as described by 
10 Meyer and Casson (Genetics 105:29-44 (1986)), except 
that RNA was transferred from the gel to the Gene 
Screen filter (DuPont, Wilmington, DE) by capillary 
action. 

Quantitation of ced-4 expression during embryonic 

15 development was done by hybridizing two duplicate 

northern blots with ced-4 cDNA clone SK2-2 and with a 
genomic DNA clone for the actin 1 gene, pW-16-210, 
which hybridizes to the 3' untranslated region of the 
actin 1 transcript (Krause and Hirsh, in: Molecular 

20 Biology of the Cytoskeleton , Borisy et al . (eds.), Cold 
Spring Harbor Laboratory, 1984, pp. 287-292). The two 
probes were of the same specific activity (4 x 10 s 
counts/minute//ig) . The emission of B particles from 
the ced-4 and 1 actin 1 bands was counted using a B 

25 counter (Betagen, Waltham, MA). The readings were 7.7 
counts/minute for the actin 1 band and 1.4 counts/ 
minute for the ced-4 band. 

The primer extension protocol was that of Sambrook 
et al. (Molecular Cloning: A Laboratory Manual, 2nd 

30 edition, Cold Spring Harbor Laboratory, 1989, pp. 7.79- 
7.83), using the primer ATTGGCGATCCTCTCGA (Seq. ID 
#22) . To define the lengths of the reaction products, 
a sequencing reaction using this primer and C10D8-5 as 
template was run adjacent to the product of the primer 

35 extension reaction in the sequencing gel. 
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Direct ion of Transcription 

The direction of transcription was determined by 
hybridizing northern blots with single-stranded RNA 
probes. The Bluescribe plasmid containing the insert 
pnl416 was linearized by digestion with either BamHI or 
Jfindlll, which cleaved at one or the other end of the 
insert. The linearized product was transcribed using 
T3 or T7 RNA polymerase, respectively, generating RNA 
from each strand. These RNA products were used to 
probe Northern blots according to a protocol developed 
by Z. Liu and V. Ambros: Filters were prehybridized in 
50% formamide, 50 mM sodium phosphate (pH 6.5), 5 x 
SSC, 8 X Denhardt's, 0.5% SDS, 250 fig /ml salmon sperm 
DNA and then hybridized with probe at 55 °C and washed 
in 4 x SSC, 0.1% SDS at 60°C 3 times for 20 minutes 
each and then in 2 x SSC, 0.1% SDS once at 60 °C for 20 
minutes . Northern blot experiments showed that the 
single-stranded RNA probe transcribed by T3 RNA 
polymerase hybridized to the 2.2 kb ced-4 mRNA, while 
the probe made by T7 RNA polymerase did not. This 
result indicates that the direction of the 
transcription is from the BamHI site toward the Hindlll 
site of pnl416. 

Determination of DNA Sequence 

For determining DNA sequences, serial deletions 
were made according to Henikoff (Gene 28:351-359 
(1984)). DNA sequences were determined using Sequenase 
and protocols obtained from US Biochemicals (Cleveland, 
OH) . The ced-4 DNA sequence was confirmed by 
sequencing both strands of cDNA and genomic DNA clones. 

Cloning of the Cosmid Fragment C10D8-5 

The cosmid C10D8 was digested with EcoRI. Two 
EcoRl fragments of 2.2 kb (r5) and 2.4 kb (r7) , both of 
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which hybridized to a mixture of ced-4 cDNA subclones 
SK2-1 and SK2-2, were isolated. r7, which hybridized 
to SK2-1, which contains the 3' half of ced-4 cDNA 
clone SK2, was cloned into the EcoRI site of plasmid 
pBSKII (Stratagene) • The EcoRI site at the 3' end of 
r7 was deleted by digesting with Styl, which cut once 
at 0.2 kb from the 3' end of the insert, and Sail, 
which cut once in the polylinker, and then religating. 
The deleted r7 plasmid was linearized with EcoKL and 
ligated with EcoRI-digested r5, which hybridized to 
Sk2-2, the 5' half of ced-4 cDNA SK2. Clones were 
analyzed for the correct orientation of the r5 insert 
based on the cDNA restriction map. One such correctly 
oriented clone was named C10D8-5. 

Microinjection and Transformation 

The procedure for microinjecting DNA into the, 
gonad to obtain germline transf ormants was basically 
that of Fire (EMBO J. 5:2673-2680 (1986)) with 
modifications introduced by J. Sulston. Cosmid DNA to 
be injected was purified twice using CsCl-gradient 
centrifugation (Maniatis et al., 1983 supra). Plasmid 
DNA to be injected was prepared by alkaline minipreps 
(Maniatis et'al., 1983 supra). DNA was treated with 
RNAase A (37 °C, 30 minutes) and then with proteinase K 
(55°C, 30 minutes), extracted with phenol and then 
chloroform, precipitated twice (first in 0.3 M sodium 
acetate and then in 0.1 M potassium acetate, pH 7.2), 
and resuspended in 5 ul of injection buffer (Fire, 1986 
supra) . DNA concentrations used for injection were 
0.1-1.0 mg/ml. 

All transformation experiments used a ced-i; 
ced-4 (nll62) ; unc-31 strain as the recipient. The 
expression of the Ced-4 phenotype was quantified by 
counting the number of cell corpses in the heads of 
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young LI animals. The cosmid C10D8 or plasmid 
subclones of C10D8 were mixed with cosmid C14G10, which 
contains the wild-type unc-32(+) gene, at a ratio of 
2:1 or 3:1 to increase the likelihood that a 
5 phenotypically non-Unc transformant would contain the 
cosmid or plasmid being tested. Generally, 20-3 0 
animals were injected in one experiment. Non-Unc Fl 
progeny of injected animals were isolated three to four 
days later. About 1/2 to 1/3 of the non-Unc progeny 
io transmitted the non-Unc phenotype to their progeny and 
could be established as lines of transf ormants . Young 
LI non-Unc progeny of such non-Unc transf ormants were 
examined using Nomarski optics to determine the number 
of cell corpses present in the heads. 

15 Ced-4 Fusion Pro tein and Antibody Preparation 

To express a Ced-4 fusion protein in E. coli, n a 
clone containing both the 5' and 3' halves of the ced-4 
cDNA (SK2-2 and SK2-1) in the expression vector pET-5a 
(Rosenberg et al., Gene 56:125-135 (1987)) was 

20 constructed. The fusion protein expressed by this 

vector was expected to include 11 amino acids of phage 
T7 gene 10 protein, 5 amino acids of linker and the 546 
amino acids encoded by ced-4 cDNA SK2. The pJ76 
plasmid, which encodes this fusion protein, was 

25 transformed into bacterial strain BL21. ced-4 fusion 
protein was produced by this transformed strain, as 
expected, and subjected to electrophoresis on a 
polyacrylamide gel. A band, with mobility equivalent 
to about 64 x io 3 Mr, specific to the transformed 

30 strain was exercised and used to immunize three 

rabbits. Sera from all three rabbits tested positive 
on western blots (Towbin et al., Proc. Natl. Acad. Sci. 
USA 76:4350-4354 (1979)). These sera were purified 
using immunoblots (Harlow and Lane, Antibodies: A 
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Laboratory Manual, Cold Spring Harbor Laboratory , 
1988) . 



Cloning of th e ced-4 Gene bv Transposon Tagging 

The ced-4 allele nl416 in the C. elegans strain 
TR679 was isolated, which carries the mutator mut- 
2(r459) and shows an elevated frequency of 
transposition elements (Collins et al., Nature 328:726- 
728 (1987); Yuan et al., Proc. Natl. Acad. Sci. USA 
88:3334-3338 (1991)). The ced-4 (nl416) mutation is 
closely linked to a newly transposed copy of the C. 
elegans transposon Tc4 (Yuan et al., 1991 supra). 
Using Tc4 as a probe, this novel Tc4 element and its 
flanking region was cloned as a 5 kb Hindlll fragment 
from a 4-6 kb size-selected ced-4 (n!416) genomic phage 
library. A 3 kb adjacent to this Tc4 element was 
isolated by digesting the 5 kb Hindlll fragment with 
BamHI. This 3 kb fragment, called pnl416, was cloned 
into the Bluescribe M13+ plasmid vector (Stratagene) . 

When used as a probe on Southern blots, pnl416 
hybridized to a 3.4 kb tfindlll fragment in DNA of wild- 
type (strain N2) and two non-Ced revertants of 
ced-4 (nl416) , ced-4 (n!416 n!712) and ced-4 (nl416 nl713) 
(Yuan and Horvitz, Dev. Biol. 138:33-41 (1990)), and a 
5 kb ffindlll fragment in ced-4 (nl416) animals. The 
hybridizing band in ced-4 (n!416) DNA is 1.6 kb larger 
than that of the wild-type or the revertants, 
indicating that an insertion of this size is present in 
the ced-4 (n!416) strain and is deleted in both 
revertants. These observations indicate that the Tc4 
insertion in ced-4 (n!416) animals is responsible for 
their Ced-4 mutant phenotype and suggest that pnl416 
contains at least part of the ced-4 gene. 
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To isolate additional genomic DNA from the region 
of this Tc4 insertion, pnl416 was used to probe a C. 
elegans Bristol N2 genomic DNA phage library. Five 
phage clones with inserts of 10 to 

15 kb were isolated and shown to share a 3 kb BamRI- 
Hindlll fragment that hybridized to pnl4l6. These 
phage clones were used to identify cosmids that 
hybridized to them and that were members of a 600 kb 
contig of overlapping cosmids (Coulson et al., Proc. 
Natl. Acad. Sci . USA 83:7821-7825 (1986)). By using 
the phage clones as probes to hybridize to Southern 
blots, a cosmid C10D8 was identified as containing all 
regions of genomic DNA present in all five phage clones 
and in pnl416. 

The ced-4 Mutant Phenotype Can Be Rescued by a 4 . 4 kb 
DNA Fragment 

To identify ced-4 (+) DNA capable of complementing 
the Ced-4 mutant phenotype, the cosmid C10D8 was 
injected into the oocytes of ced-4 (nll62) animals. To 
facilitate the identification of transgenic animals, a 
mutation in the unc-31 gene, which affects locomotion, 
was included as a marker for co-transformation (Kim and 
Horvitz, Genes & Dev. 4:357-371 (1990)). Cosmid 
C14G10, which contains the wild-type allele of unc-31 
and does not have Ced-4-rescuing activity was 
coinjected with cosmid C10D8 into ced-1 (e!735) ; 
unc-31 (e928) ; ced-4 (n!162) animals. The ced-1 mutation 
was included to facilitate the scoring of the ced-4 
mutant phenotype (Ellis and Horvitz, Cell 44:817-829 
(1986)). Specifically, when a cell undergoes 
programmed cell death in C. elegans, its corpse is 
quiclciy engulfed and destroyed by a neighboring cell 
(Robertson and Thomson, J. Embryol . Exp. Morph. 67:89- 
100 (1982); Sulston et al., Dev. Biol. 100:64-119 
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(1983)). A ced-2 mutation prevents this engulfment, 
allowing the cell corpse to remain intact (Hedgecock et 
al., Science 220:1277-1280 (1983)). Thus, in a first 
or second stage (LI or L2) ced-1 mutant larva, many 
5 cell corpses are present and can be easily visualized 
using Normaski optics, ced-4 mutations prevent cell 
death and the appearance of these corpses. Thus, 
suppression of the Ced-4 mutant phenotype by a wild- 
type ced-4 gene can be observed and readily quantified 
10 in a ced-l mutant background based on an increase in 
the number of visible cell corpses. 

From one such microinjection experiment, three 
non-Unc animals rescued for the Unc-31 mutant phenotype 
were picked from among the Fl progeny, and from one of 
15 them a line of non-Unc transformants was obtained. No 
true-breeding non-Unc animals could be isolated from 
this line: about 25% of the progeny of all non-Unc, 
animals were Unc. Since no inviable zygotes were 
observed among the progeny of these non-Unc animals, 
20 this transformant did not carry a recessive lethal 
[U insertion mutation. Rather, it seems likely that the 

rj injected DNA was maintained as an extrachromosomal 

yQ array that was segregated to only some gametes, as has 

been reported previously for many other C. elegans 
25 transgenic strains (e.g., Stinchcomb et al., Mol . Cell 
Biol. 82:110-156 (1985); Fire, EMBO J. 5:2673-2680 
(1986); Way and Chalfie, Cell 54:5-16 (1988)). This 
putative extrachromosomal array was named nExl. Young 
LI progeny of nExl -containing animals were examined 
30 using Nomarski optics for the Ced-4 phenotype. 

Young LI ced-1 animals have an average of 23 cell 
corpses in the head, while ced-1 (el735) ; ced-4(n!162) 
animals have an average of 0.6 cell corpses (Ellis and 
Horvitz, Cell 44:817-829 (1986)). Young LI ced-1; 
35 ced-4 (n!162) ; nExl animals had an average of nine cell 
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corpses in the head. These results indicate that 
cosmid C10D8 restored significant, but not total, 
ced-4 (+; activity in the transf ormants. 

To delineate the ced-4 gene within C10D8, various 
5 subclones of C10D8 were injected into ced-4 mutant 
animals and tested for their ability to rescue the 
Ced-4 mutant phenotype (Table 1) . The smallest 
subclone plasmid that could rescue the ced-4 phenotype 
as effectively as cosmid C10D8 was a 4.4 kb fragment, 

10 called C10D8-5. C10D8-5 and the unc-31 (+) -containing 
cosmid C14G10 were coinjected into ced-1/ unc-31; 
ced-4(nll62) animals. Two lines of non-Unc 
transf ormants were isolated. Since these animals 
continued to segregate Unc animals and did not produce 

15 inviable zygotes, both appeared to carry 

extrachromosomal arrays, which were designated nEx7 and 
nEx8. Young LI animals from these transf ormant strains 
had an average of 11.5 cell corpses in their heads, 
indicating that plasmid C10D8-5 restored ced-4 (+) 

20 activity as well as did cosmid C10D8 (Table 1) . 

Identification of a ced-4 Transcript 

Restriction sites of plasmid C10D8-5 (which can 
rescue the Cfed-4 phenotype) and pnl416 (which contains 
sequences adjacent to the Tc4 insertion site) were 
25 mapped. C10D805 was found to overlap with 2 kb of 
sequence in pnl416, including the Tc4 insertion site 
(Figure 8) . 

In Northern blot experiments, both pnl416 and 
C10D8-5 were used to probe poly (A) + RNA populations of 
30 mixed developmental stages of wild-type (strain N2), 
ced-4 (nl416) , and ced-4 (nl416 nl712) and ced-4 (nl416 
nl713) revertant animals. pnl416 hybridized to a 2.2 
kb transcript and an 0.9 kb transcript in RNA from N2 
animals, and a 3 kb transcript, a transcript slightly 
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larger than the wild-type 2.2 kb transcript, and a 
transcript slightly smaller than the wild-type 0.9 kb 
transcript in ced-4(nl416) animals. The 3.8 kb RNA 
contained Tc4 sequence (see below) , suggesting that 
this RNA resulted from the insertion of the 1.6 kb Tc4 
sequence into the ced-4 sequence encoding 2.2 kb 
transcript. The transcript slightly larger than the 
2.2 kb wild-type transcript did not contain Tc4 
sequence. This ced-4 (nl416) RNA might have been an 
aberrant transcript containing sequences adjacent to 
the ced-4 gene: when pnl416 was used as a probe, the 
wild-type 2.2 kb and the slightly larger transcript in 
this mutant were relatively similar in intensities, 
whereas when ced-4 cDNA clone SK2-1 was used as a 
probe, this mutant transcript was not detected (see 
below) . These observations indicate that the 
ced-4 (nl416) 2.2 kb transcript contains sequences from 
the ced-4 region but does not contain sequences 
corresponding to at least the 3' half of the ced-4 
mRNA. The two revertants of ced-4 (nl416) , ced-4 (nl416 
nl712) and ced-4 (nl416 nl713) , contained both 2.2 kb 
and 0.9 kb transcripts with similar sizes to the wild- 
type transcripts. Thus, both the 2.2 kb and the 0.9 kb 
transcripts were altered in ced-4 (nl4l6) animals, and 
both were restored in the two non-Ced revertants. 

To determine if any of the transcripts contains 
Tc4 sequence, the Northern blots were probed with 
Tc4 -nl351, which contains the 1.6 kb Tc4 element 
present in the Tc4-induced mutant unc-86 (nl351) as well 
as 4 kb of unc-86 sequences. Tc4-n2352 hybridized both 
to a 3.8 kb transcript of the Tc4-induced mutant 
ced-4 (nl416) and to a 1.5 kb unc-68 transcript in both 
ced-4 (nl416) and N2 animals. 

To determine whether one or both of the 2.2 kb and 
0.9 kb transcripts are encoded by ced-4, subclone 
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C10D8-5, which rescued the Ced-4 phenotype, was used to 
probe the Northern blots. C10D8-5 detected the wild- 
type 2.2 kb transcript, the ced-4 (nl4 16) transcript 
slightly larger than the 2.2 kb transcript, and the 
5 ced-4 (nl416) 3.8 kb transcript. C10D8-5 did not 
hybridize to the 0.9 kb transcript, indicating that 
this transcript is unlikely to be encoded by ced-4. 
C10D8-5 also detected a 1.4 kb transcript, which was 
not altered by the Tc4 insertion in ced-4 (nl416) 

10 animals. Only a 470 bp J£coRI-StuI fragment at one end 
of C10D8-5 hybridized to this 1.4 kb RNA. Since 
C10D8-5 did not contain the complete coding region for 
this RNA, and since this RNA was unaffected in 
ced-4 (nl416) animals, this 1.4 kb RNA seems unlikely to 

15 be a ced-4 transcript. The relationships among cosmid 
C10D8-5, pnl416 and the 0.9 kb, 1.4 kb and 2.2 kb 
transcripts are summarized in Figure 8. 

On Northern blots probed with the ced-4 cDNA clone 
SK2-1, the level of the 2.2 kb transcript showed 

20 significant reduction in all three independently 

derived EMS-induced ced-4 mutants examined, strongly 
supporting the hypothesis that this 2.2 kb transcript 
is a ced-4 transcript. Total RNA from N2, 
ced-4 (nll62) / ced-4 (nl416) , ced-4 (nl894 ) and 

25 ced-4 (nl920) eggs was probed with 32 P-labelled ced-4 
cDNA SK2-1. An act in 1 probe (Krause and Hirsh, in: 
Molecular Biology of the Cytoskeleton , Borisy et al . 
(eds.)r Cold Spring Harbor Laboratory, 1984, pp. 287- 
292) was used as an internal control for the amount of 

30 RNA loaded in each lane. The ratios of the intensity 
of the ced-4 band to that of actin band in N2, nll62, 
nl416 and nl894 were 0.5, 0.17, 0 and 0.12, 
respectively. A Northern blot of poly (A) + RNA from 
stage-synchronized animals was probed with pnl416, 

35 which hybridizes both to the 2.2 kb ced-4 transcript 
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and to a 0.9 kb transcript. The 0.9 kb transcript 
seems to be expressed mostly in eggs and adults. The 
presence of RNA in all lanes was confirmed by loading 
1/10 of each sample on another gel and probing a 
Northern blot from this gel using the C. elegans actin 
1 gene (Krause and Hirsh, 1984 supra) . That all of 
these distinct ced-4 mutations cause reduced levels of 
a ced-4 transcript could reflect either instability of 
all three mutant transcripts or a role for ced-4 in 
regulating its own expression. 

Based upon these results, it can be concluded that 
the 2.2 kb RNA is a ced-4 transcript. It is not known 
why the 0.9 kb RNA is also altered in ced-4 (nl416) 
animals. Perhaps transcription of the 0.9 kb RNA is 
initiated incorrectly as a consequence of the nearby 
Tc4 element. 

ced-4 Expression is Primarily Embryonic 

A Northern blot containing RNAs from stage- 
synchronized animals of different developmental stages 
probed with pnl416 showed that the 2.2 kb ced-4 
transcript was expressed primarily during 
embryogenesis. This result is consistent with the 
observation that 113 of the 131 programmed cell deaths 
in the C. elegans hermaphrodite are embryonic (Sulston 
and Horvitz, Dev. Biol. 82:110-156 (1977); Sulston et 
al., Dev. Biol. 200:64-119 (1983)). The 2.2 kb RNA was 
relatively abundant during embryonic development. The 
0.9 kb transcript was expressed mostly in eggs and 
adults. The presence of RNA in all lanes was confirmed 
by loading 1/10 of each sample on another gel and 
probing a Northern blot from this gel with the C. 
elegans actin 1 gene (Krause and Hirsh, 1984 supra) . 
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The ced-4 Transc ript is Present in a ced-3 Mutant 

The activities of both ced-3 and ced-4 are 
required for programmed cell death (Ellis and Horvitz, 
Cell 44:817-819 (1986)). One possibility is that one 
of these genes positively regulates the expression of 
the other. For this reason, a Northern blot of wild- 
type strain N2 and ced-3 (n717) poly (A) + RNA was probed 
with pnl416. This experiment showed that the 2.2 kb 
ced-4 transcript was present at an apparently normal 
level in this ced-3 mutant. Thus, the activity of the 
ced-3 gene is unlikely to be necessary for the 
expression of the ced-4 2.2 kb transcript. 

Identification of ced-4 cDNA Clones 

To isolate cDNA clones of ced-4, pnl416 was used 
to probe a C. elegans cDNA phage library made from 
wild-type strain N2 mixed-stage RNA (Kim and Horvitz, 
Genes & Dev. 4:357-371 (1990)). Two cDNA clones were 
isolated. The two cDNA clones (named SKI and SK2) 
hybridized to the 2.2 kb ced-4 transcript. Both are 
about 1.8 kb in size, and both contain one 0.8 kb and 
one 1.0 kb EcoRI fragment. These EcoRI fragments were 
subcloned into plasmid vector Bluescribe M13+ 
(Stratagene) i The two subclones derived from SKI were 
named SK1-1 and SK1-2, and the two subclones derived 
from SK2 were named SK2-1 and SK2-2. The restriction 
maps of the SKI- and SK2-derived clones were the same. 
Sequence analysis of the ends of the four cDNA 
subclones confirmed the equivalence of the SKI and SK2 
clones, except that SK1-2 contains a poly (A) sequence 
of more than 50 bp at its 5 ' end. This poly (A) 
sequence is probably a cDNA cloning artifact, since 
SK1-2 contains the 5' half of the cDNA (see below). 
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The ced-4 Sequence 

The DNA sequence of the SK2 1.8 kb cDNA clone was . 
determined. This sequence includes an open reading 
frame encoding 546 amino acids (Figure 1; Seq. ID #2), 
5 which is consistent with the results of Northern blot 
analysis using single-stranded RNA probes. An ochre 
termination codon (TAA) is located in-frame near the 3' 
end, indicating that the 3' end of the 2.2 kb 
transcript is most likely included in this cDNA. The 

10 open reading frame extends to the 5' end of the 1.8 kb 
cDNA, suggesting that this cDNA might lack the 5' end 
of the ced-4 coding region. 

A primer extension experiment was performed to 
determine the ced-4 transcription initiation site(s) 

15 using the primer ATTGGCGATCCTCTCGA (Seq. ID #23) and 
C10D8-5 as template. A major transcriptional 
initiation site was identified at 54 bp before (5' of) 
the beginning of the ced-4 cDNA SK2 and a minor 
initiation site at 54 bp after (3 ' of) the beginning of 

20 this cDNA (Figure 1) . The first AUG codon after the 
presumptive major start site is located at 9 bp before 
the beginning of the cDNA (Figure 1) . If this site is 
used to initiate protein synthesis, the Ced-4 protein 
would be 549 -'amino acids in length. The first AUG 

25 codon after the presumptive minor start site is located 
at 130 bp after the beginning of the cDNA. If this 
site is used, the Ced-4 protein would be 503 amino 
acids in length. Preliminary results using an anti- 
Ced-4 antibody raised against a Ced-4 fusion protein 

30 showed that endogenous Ced-4 protein is slightly 

smaller in molecular weight than a Ced-4 fusion protein 
of 562 amino acids expressed in E. coli. Thus, most 
Ced-4 protein is probably initiated near the start of 
the cDNA and is presumably 549 amino acids in length 

35 and 62,977 in relative molecular mass. The direction 
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of the open reading frame is consistent with the 
direction of transcription, as demonstrated by probing 
Northern blots with single-stranded RNA probes. The 
presumptive Ced-4 protein is highly hydrophilic, with a. 
5 pi of 5.12. The longest hydrophobic region is a 
segment of 12 amino acids from residues 382 to 393. 

A Western blot of wild-type strain N2 mixed-stage, 
ced-4 (nl416) mixed-stage, wild-type egg, and 
bacterially expressed protein (pJ76) was probed using 

10 anti-Ced-4 antibody. Ced-4 fusion protein (pJ76) was 
made by cloning ced-4 cDNA SK2 into the T7 expression 
vector pET-5a (Rosenberg et al., Gene 56:125-135 
(1987)), so that 546 amino acids of Ced-4 sequence were 
fused to 11 amino acids of T7 gene 10 protein and 5 

15 amino acids of linker sequence. This Ced-4 fusion 
protein is similar in relative molecular mass to the 
endogenous Ced-4 protein, which is present in wildrtype 
(N2) but missing in ced-4 (nl416) animals. The proteins 
phosphorylase b, 97 x 10 3 ; bovine serum albumin, 66 x 

20 10 3 (Hirayama et al., Biochem. Biophys, Res. Comm. 

173:639-646 (1990)); and ovalbumin, 43 x 10 3 , were used 
as molecular weight standards. 

To confirm the DNA sequence obtained from the 
ced-4 cDNAs and to study the structure of the ced-4 

25 gene, the sequences of the 4.4 kb cosmid subclone 

C10D8-5, the 3 kb insert pnl416, and the 2 kb Jfindlll- 
BamHI fragment that contains the Tc4 insertion in the 
ced-4 (nl416) mutant were determined. Comparison of the 
ced-4 genomic and cDNA sequences revealed that the 

30 ced-4 gene has seven introns of sizes ranging from 44 
bp to 557 bp (Figure 2). The exon sequences of genomic 
clone C10D8-5 are identical to the sequences of ced-4 
cDNA £kb. Comparison of the Tc4 insertion site in 
ced-4 (nl416) DNA with the ced-4 (+) genomic and cDNA 
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sequences indicated that Tc4 inserted into an exon in 
the ced-4 gene in ced-4 (n!416) animals (Figure 2). 

The DNA sequences of eight EMS-induced ced-4 
alleles were also determined (Table 2). One of the 
5 eight, nl948, is a missense mutation. Of the seven 

others, four create stop codons and three are predicted 
to affect splicing of the ced-4 transcript. The 
positions of these mutations are indicated in Figure 2. 
These findings indicate that the phenotypes of these 
10 mutants (Ellis and Horvitz, Cell 44:817-829 (1986)) 
result from a complete loss of ced-4 gene function. 
These mutations establish the null phenotype of the 
ced-4 gene, confirming that ced-4 function is not 
essential for viability. 

15 The Ced-4 Protein Has Two Regions Similar to Known 
Calcium-Binding Domains 

By direct inspection, the sequence of the putative 
Ced-4 protein was compared with the consensus sequence 
of the calcium-binding loop of the EF-hand domain 

20 (Tufty and Kretsinger, Science 187:161-171 (1975); 
Kretsinger, Cold Spring Harbor Symp. Quant. Biol. 
52:499-510 (1987); Szebenyi and Moffat, J. Biol. Chem. 
26:8761-8777 / (1986) ) . Two regions of the Ced-4 protein 
were identified that might bind calcium (Figure 3) . 

25 The EF-hand is a 29 amino acid domain consisting 

of a helix-loop-helix region, with the loop portion 
(residues 10-21) coordinating calcium-binding via the 
side-chain oxygens of serine, threonine, asparagine, 
aspartic acid, glutamine or glutamic acid. These 

30 residues occur at five of the vertices of an 

octahedron: X (position 10), Y (12), Z (14), -X (18), 
-Z (21) . EF-hand amino acid sequences vary 
considerably in the residues present in the calcium- 
binding loop (Figure 3), and some EF-hand domains have 
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only one helical region (Kretsinger, 1987 supra) . The 
consensus sequence is shown at the top of Figure 3. 
Positions Y, Z, and -X can have any of a number of 
amino acids which have oxygen-containing side chains. 
5 Position X is usually aspartic acid, and position -Z is 
usually glutamic acid. 

The sequences of parvalbumins from carp muscle 
(Seq. ID #3; Nockolds et al., Proc. Natl. Acad. Sci. 
USA 69:581-584 (1972)), the intestinal calcium-binding 

10 protein (ICaBP) (Seq. ID #7-8; Szebenyi et al., Nature 
294:327-332 (1981)), troponin C (Seq. ID #9-12; Collins 
et al., FEBS Lett. 36:268-272 (1973)) and calmodulin 
(Seq. ID #13; Zimmer et al., J. Biol. Ch&m. 263:19,370- 
19,383 (1988); Babu et al., Nature 315:37-40 (1985)) 

15 show canonical EF-hands. The hake and ray parvalbumins 
(Seq. ID #4-5; Capony et al. Eur. J. Biochem. 32:97-108 
(1973)); Thatcher and Pechere, Eur. J. Biochem. 75:121- 
132 (1977)), sarcoplasmic calcium-binding protein 
(SCBP) from the protochordate Amphioxus (Seq. ID #6; 

20 Takagi et al., Biochemistry 25: 3585-3592 (1986)), 
trypsinogen (Seq. ID #14; Bode and Schwager, J. Mol. 
Biol. 98:693-717 (1975)), fibrinogen (Seq. ID #15; 
Doolittle, Ann. Rev. Biochem. 53:195-229 (1984); Dang 
et al., J. Biol. Chem. 260:9713-9719 (1985)), villin 

25 (Seq. ID #16; Hesterberg and Weber, J. Biol. Chem. 

258:365-369 (1983)) and galactose-binding protein (GBP) 
(Seq. ID #17; Vyas et al., Nature 327:635-638 (1987)) 
show variations from the consensus sequence. GBP does 
not contain the helices of the EF-hand. 

30 The potential calcium-binding loops of sequence 1 

and sequence 2 are located at amino acids 77-88 and 
amino acids 292-303 of the Ced-4 protein, respectively 
(Figure 1 3). In its putative calcium-binding loop, the 
first potential EF-hand-like sequence of the Ced-4 

35 protein has four (positions Y, Z, -X, -Z) of the five 
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conserved residues with oxygen-containing side chains 
(shown in bold) , and the fifth position (X) has a 
tyrosine rather than an aspartic acid; tyrosine 
contains oxygen in its side chain. The second 
potential EF-hand-like sequence of the Ced-4 protein 
has three residues (positions Z, -X, -Z) that match the 
consensus sequence, and amino acids with oxygen- 
containing side chains at the other two positions. 
These observations suggest that these two regions of 
the Ced-4 protein might bind calcium. Like the Ced-4 
protein, a number of known calcium-binding proteins, 
such a bovine intestinal calcium-binding protein 
(ICaBP) (Szebenyi and Moffat, 1986 supra), rabbit 
troponin C (Collins et al., 1973 supra), trypsinogen 
and villin (Doolittle, 1984 supra; Danget et al., 1985 
supra) have only three or four conserved residues at 
these five positions (Figure 3). The EF-hand domains 
in ICaBP and troponin C have been shown by X-ray 
crystallography to bind calcium. 

One major difference between the Ced-4 protein and 
the calcium-binding loop of the EF-hand consensus 
sequence is at position 15. Here, the two Ced-4 
sequences have a histidine and a glutamic acid, 
respectively/ whereas most ET-hand-containing proteins 
have a glycine; this glycine has been suggested to be 
important for the turning of the loop (Kretsinger, 1987 
supra). However, a histidine is present at this 
position in a parvalbumin and an aspartic acid is 
present in another parvalbumin and also in a 
sarcoplasmic calcium-binding protein (Kretsinger, 1987 
supra) (Figure 3). Thus, the presence of histidine or 
glutamic acid at position 15 does not rule out the 
possibility that these regions bind calcium • 

The calcium-binding loop (positions 10-21) of the 
EF-hand is thought to be preceded (positions 1-9) and 
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followed by alpha-helical domains (positions 22-29) 
(Kretsinger, 1987 supra) . Since position 3 of Ced-4 
sequence 1 and positions 26 and 28 of Ced-4 sequence 2 
are prolines, these regions might not form alpha- 
5 helices • However, the known calcium-binding protein 
galactose-binding protein (GBP) has a calcium-binding 
domain similar to that of the EF-hand (Figure 3) but 
without the two helices; furthermore, position 29 of 
GBP is proline (Vyas et al., 1987 supra). Thus, the 
10 Ced-4 protein need not contain such alpha-helical 
calcium-binding domains. 

Based upon these considerations, it seems likely 
that the Ced-4 protein binds calcium or a similar 
divalent cation. 



15 EXAMPLE 2 

CLONING. SEQUENCING. AND CHARACTERIZATION OF 

THE CED-3 GENE 

MATERIALS AND METHODS 

General Methods and Strains 
'4 20 The techniques used for the culturing of C. 

elegans were as described by Brenner (Genetics 77:71-94 
(1974)). All strains were grown at 20°C. The wild- 
type parent strains were C. elegans variety Bristol 
strain N2, Bergerac strain EM1002 (Emmons et al., Cell 
25 32:55-65 (1983)), C. briggsae and C. vulgaris (obtained 
from V. Ambros) . The genetic markers used are 
described below. These markers have been described by 
Brenner (1974 supra), and Hodgkin et al. (In: The 
Nematode Caenorhabditis elegans, Wood and the Community 
30 of C. elegans Researchers (eds.), Cold Spring Harbor 
Laboratory, 1988, pp 491-584). Genetic nomenclature 
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follows the standard system (Horvitz et al., Mol. Gen. 
Genet. 175:129-133 (1979)). 

LG I: ced-l(el375) ; unc-54(r323) 
LG VI: unc-31(e928) , unc-30 (el91) , ced-3(n717, n718, 
5 nl040, nll29, n!163, nll64, nll65, nl286, 

1X1949, 112426, n2430, 1X2433), unc-26 (e205) , 
dpy-4 (ell66) 
LG V: egl-1 (n986) ; unc-76 (e911) 
LG X: dpy-3(e27) 



10 Isolation of Additional Alleles of ced-3 

A non-complementation screen was designed to 
isolate new alleles of ced-3. Because animals 
heterozygous for ced-3 (n717) in trans to a deficiency 
are viable (Ellis and Horvitz, Cell 44:817-829 (1986)), 

15 animals carrying a complete loss-of-f unction ced-3 
allele generated by mutagenesis were expected to be 
viable in trans to ced-3 (n717) , even if the new allele 
was inviable in homozygotes. Fourteen EMS mutagenized 
egl-1 males were mated with ced-3 (n 71 7) unc-26 (e205) ; 

20 egl-1 (n487) ; dpy-3(e27) hermaphrodites, egrl-2 was used 
as a marker in this screen. Dominant mutations in 
egl-1 cause tihe two hermaphrodite specific neurons, the 
HSNs, to undergo programmed cell death (Trent et al., 
Genetics 104:619-647 (1983)). The HSNs are required 

25 for normal egg-laying, and egl-l(n986) hermaphrodites, 
which lack HSNs, are egg-laying defective (Trent et 
al., 1983 supra). The mutant phenotype of egl-1 is 
suppressed in a ced-3; egl-1 strain because mutations 
in ced-3 block programmed cell deaths, egl-l males 

30 were mutagenized with EMS and crossed with ced-3 (n717) , 
unc-26 (e205) ; egl-1 (n487) ; dpy-3 (e27) . Most cross 
progeny were egg-laying defective because they were 
heterozygous for ced-3 and homozygous for egl-1. Rare 
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egg-laying competent animals were picked as candidates 
for carrying new alleles of ced-3. Four such animals 
were isolated from about 10,000 Fl cross progeny of 
EMS -mutagen i zed animals. These new mutations were made 
5 homozygous to confirm that they carried recessive 
mutations of ced-3. 

Molecular Biology 

Standard techniques of molecular biology were used 
(Maniatis et al., Molecular Cloning: A Laboratory 
10 Manual, Cold Spring Harbor Laboratory, 1983). 

Two cosmid libraries were used extensively in this 
work: a Sau3AI partial digest genomic library of 7000 
clones in the vector pHC79 and a Sau3AI partial digest 
genomic library of 6000 clones in the vector pJB8 (Ish- 
15 Horowicz and Burke, Nucleic Acids Res. 9:2989 (1981)). 

The "right" end of MMM-C1 was cloned by cutting it 
with Hindlll and self-ligating. The "left" end of 
MMM-C1 was cloned by cutting it with Bglll or Sail and 
self-ligating. 

20 The "right" end of Jc8 was made by digesting Jc8 

with EcoRI and self-ligating. The "left" end of Jc8 
was made by digesting Jc8 by Sail and self-ligating. 

C. elegans RNA was extracted using guanidine 
isothiocyanate (Kim and Horvitz, Genes & Dev. 4:357-371 

25 (1990)). Poly(A) + RNA was selected from total RNA by a 
poly(dT) column (Maniatis et al. f 1983 supra). To 
prepare stage-synchronized animals, worms were 
synchronized at different developmental stages (Meyer 
and Casson, Genetics 106:29-44 (1986)). 

30 For DNA sequencing, serial deletions were made 

according to a procedure developed by Henikoff (Gene 
28:351^359 (1984)). DNA sequences were determined 
using Sequenase and protocols obtained from US 
Biochemicals with minor modifications. 
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The Tel DNA probe for Southern blots was pCe2001, 
which contains a Bergerac Tel element (Emmons et al., 
Cell 32:55-65 (1983)). Enzymes were purchased from New 
England Biolabs, and radioactive nucleotides were from . 
5 Amersham. 

Primer extension procedures followed the protocol 
by Robert E. Kingston (In: Current Protocols in 
Molecular Biology, Ausubel et al. (eds.)# Greene 
Publishing Associates and Wiley-Interscienee, New York, 
10 p. 4.8.1) with minor modifications. 

Polymerase chain reaction (PCR) was carried out 
using standard protocols supplied by the GeneAmp Kit 
(Perkin Elmer) . The primers used for primer extension 
and PCR are as follows: 



15 Pex2: 5' TCATCGACTTTTAGATGACTAGAGAACATC 3' 

(Seq. ID #24); 
Pexl: 5' GTTGCACTGCTTTCACGATCTCCCGTCTCT 3' 

(Seq. ID #25); 
SL1: 5' GTTTAATTACCCAAGTTTGAG 3 ' (Seq. ID #26) ; 

20 SL2: 5' GGTTTTAACCAGTTACTCAAG 3' (Seq. ID #27); 

Log5: 5' CCGGTGACATTGGACACTC 3' (Seq. ID #28); and 

OligolO: 5' ACTATTCAACACTTG 3' (Seq. ID #29). 



Germline Transformation 

The procedure for microinjection basically follows 
25 that of A. Fire (EMBO J. 5:2673-2680 (1986)) with 

modifications: Cosmid DNA was twice purified by CsCl- 
gradient. Miniprep DNA was used when deleted cosmids 
were injected. To prepare miniprep DNA, DNA from 1.5 
ml overnight bacterial culture in superbroth (12 g 
Bacto-tryptone , 24 g yeast extract, 8 ml 50% glycerol, 
900 ml H 2 0, autoclaved; after autoclaving, 100 ml 0.17 
M KH 2 P0 4 and 0.72 M KH 2 P0 4 were added) was extracted by 
alkaline lysis method as described in Maniatis et al. 
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(1983 supra), DNA was treated with RNase A (37 °, 30 
minutes) and then with protease K (55° , 30 minutes), 
extracted with phenol and then chloroform, precipitated 
twice (first in 0.3 M sodium acetate and second in 0.1 
M potassium acetate, pH 7.2), and resuspended in 5 nl 
injection buffer as described by A. Fire (1986 supra) . 
The DNA concentration for injection is in the range of 
100 ug to 1 mg per ml. 

All transformation experiments used ced-l(el735) ; 
ujic-32 (e928) ced-3 (n717) strain, unc-32 was used as a 
marker for co-transformation (Kim and Horvitz, 1990 
supra) . ced-1 was present to facilitate scoring of the 
ced-3 phenotype. The mutations in ced-2 block the 
engulf ment process of cell death, which makes the 
corpses of the dead cells persist much longer than in 
wild-type animals (Hedgecock et al., Science 220:1277- 
1280 (1983)). The ced-3 phenotype was scored as the 
number of dead cells present in the head of young LI 
animals. The cosmid C10D8 or the plasmid subclones of 
C10D8 were mixed with C14G10 (unc-31(+) -containing) at 
a ratio of 2:1 or 3:1 to increase the chances that a 
Unc-31(+) transformant would contain the cosmid or 
plasmid being tested as well. Usually, 20-30 animals 
were injected in one experiment. Non-Unc Fl progeny of 
the injected animal were isolated three to four days 
later. About 1/2 to 1/3 of the non-Unc progeny 
transmitted the non-Unc phenotype to F2 progeny and 
established a transformant line. The young LI progeny 
of such non-Unc transformant were checked for the 
number of dead cells present in the head using Nomarski 
optics . 
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RESULTS 



Isolation of Additional Allele 

All of the ced-3 alleles that existed previously 
were isolated in screens designed to detect viable 
5 mutants displaying the Ced phenotype (Ellis and 

Horvitz, Cell 44:817-829 (1986)). Such screens may 
have systematically missed any class of ced-3 mutations 
that is inviable as homozygotes. For this reason, a 
scheme was designed that could isolate recessive lethal 

10 alleles of ced-3. Four new alleles of ced-3 (nll63, 
nll64, nll65, nl286) were isolated in this way. since 
new alleles were isolated at a frequency of about l in 
2500, close to the frequency expected for the 
generation of null mutations by EMS in an average C. 

15 elegans gene (Brenner, Genetics 77:71-94 (1974); 

Greenwald and Horvitz, Genetics 96:147-160 (1980)), and 
all four alleles are homozygous viable, it was - 
concluded that the null allele of ced-3 is viable. 

Mapping RFT. Ps near 

20 Tel is a C. elegans transposable element that is 

thought to be immobile in the common laboratory Bristol 
strain and in the Bergerac strain (Emmons et al., Cell 
32:55-65 (1983)). m the Bristol strain, there are 30 
copies of Tel, while in the Bergerac strain, there are 

25 more than 400 copies of Tel (Emmons et al., 1983 supra; 
Finney, Ph.D. thesis, Massachusetts institute of 
Technology, Cambridge, Massachusetts, 1987). Because 
the size of the c. elegans genome is small (haploid 
genome size 8 x 10 7 bp) (Sulston and Brenner, Genetics 

30 77:95-104 (1976)), a polymorphism due to Tel between 
the Bristol and Bergerac strains would be expected to 
occur about once every 200 kb. Restriction fragment 
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length polymorphisms (RFLPs) can be used as genetic 
markers and mapped in a manner identical to 
conventional mutant phenotypes. A general scheme has 
been designed to map Tel elements that are dimorphic 
between the Bristol and Bergerac strains near any gene 
of interest (Ruvkun et al., Genetics, 221:501-516 
(1989)). once tight linkage of a particular Tel to a 
gene of interest has been established, that Tel can be 
cloned and used to initiate chromosome walking. 

A 5.1 kb Bristol-specific Tel EcoRl fragment was 
tentatively identified as containing the Tel closest to 
ced-3. This Tel fragment was cloned using cosmids from 
a set of Tel-containing C. elegans Bristol genomic DNA 
fragments. DNA was prepared from 46 such TC1- 
containing cosmids, and this DNA was screened using 
Southern blots to identify the cosmids that contain a 
5.1 kb EcoRI Tcl-containing fragment. Two such cosmids 
were identified: MMM-C1 and MMM-C9. The 5.1 kb ScoRI 
fragment was subcloned from MMM-Cl into pUC13 
(Promega) . since both ends of Tel contain an EcoRV 
site (Rosenzweig et al., Nucleic Acids Res. 22:4201- 
4209 (1983)), EcoRV was used to remove Tel from the 5.1 
kb EcoRI fragment, generating a plasmid that contains 
only the unique flanking region of this Tcl-containing 
fragment. This plasmid was then used to map the 
specific Tel without the interference of other Tel 
elements. 

unc-30(el91) ced-3(n717) dpy-4 (ell66) /+++ males ~ 
were crossed with Bergerac (EM1002) hermaphrodites, and 
Unc non-Dpy or Dpy non-Unc recombinants were picked 
from among the F2 progeny. The recombinants were 
allowed to self -fertilize, and strains that were 
homozygous for either unc-30(el91) dpy-4 (Bergerac) or 
unc-30 (Bergerac) dpy-4 (ell66) were isolated. After 
identifying the ced genotypes of these recombinant 
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strains, DNA was prepared from these strains. A 
Southern blot of DNA from these recombinants was probed 
with the flanking sequence of the 5.1 Jcb EcoRI Tel 
fragment. This probe detects a 5.1 kb fragment in 
5 Bristol N2 and a 3.4 kb fragment in Bergerac. Five out 
of five unc-30 ced-3 dpy(+Berg) recombinants, and one 
of one unc-30 (+Berg) ced-3 dpy-4 recombinants showed 
the Bristol pattern. Nine of ten unc-30 (+Berg) dpy-4 
recombinants showed the Bergerac pattern. Only one 

10 recombinant of unc-30 (+Berg) dpy-4 resulted from a 
cross-over between ced-3 and the 5.1 kb Tel element. 
The genetic distance between ced-3 and dpy-4 is 2 map 
units (mu). Thus, this Tel element is located 0.1 mu 
on the right side of ced-3. 

15 Cosmids MMM-C1 and MMM-C9 were used to test 

whether any previously mapped genomic DNA cosmids 
overlapped with these two cosmids. A contig of 
overlapping cosmids was identified that extended the 
cloned region near ced-3 in one direction. 

20 To orient MMM-C1 with respect to this contig, both 

ends of MMM-C1 were subcloned and these subclones were 
used to probe the nearest neighboring cosmid C48D1. 
The "right" end of MMM-C1 does not hybridize to C48D1, 
while the "left" end does. Therefore, the "right" end 

25 of MMM-Cl extends further away from the contig. To 

extend this contig, the "right" end of MMM-Cl was used 
to probe the filters of two cosmid libraries (Coulson 
et al., Proc. Natl. Acad. Sci. USA 83:7821-7825 
(1986)). one clone, Jc8, was found to extend MMM-Cl in 

30 the opposite direction of the contig. 

RFLPs between the Bergerac and Bristol strains 
were used to orient the contig with respect to the 
genetic map. Bristol (N2) and Bergerac (EM1002) DNA 
was digested with various restriction enzymes and 
35. probed with different cosmids to look for RFLPs. Once 
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such an RFLP was found, DNA from recombinants of the 
Bristol and Bergerac strains between ced-3 and unc-26, 
and between unc-30 and ced-3 was used to determine the 
position of the RFLP with respect to ced-3. 

The "right" end of Jc8, which represents one end 
of the contig, detects an RFLP (nP33) when N2 and 
EM1002 DNA was digested with HindHI. A Southern blot 
of DNA from recombinants between three ced-3 (+Berg) 
unc-26 was probed with the "right" end of Jc8. Three 
of three +Berg unc-26 recombinants showed the Bristol 
pattern, while two of two ced-3 unc-26 (+Berg) 
recombinants showed the Bergerac pattern. Thus, nP33 
mapped very close or to the right side of unc-26. 

The "left" end of Jc8 also detects a HindHI RFLP 
15 (nP34) . The same Southern blot was reprobed with the 
Jc8 "left" end. Two of the two ced-3 unc-26 (+Berg) 
recombinants and two of the three ced-3 (+Berg) unc-26 
recombinants showed the Bergerac pattern. One of the 
three ced-3 (+Berg) unc-2 6 recombinants showed the 
20 Bristol pattern. The genetic distance between ced-3 
and unc-26 is 0.2 rou. Thus, nP34 was mapped between 
ced-3 and unc-26, about 0.1 mu on the right side of 
ced-3 . 

The flanking sequence of the 5.1 kb EcoRI Tel 
5 fragment (named nP35) was used to probe the same set of 
recombinants. Two of three ced-3 (+Berg) unc-26 
recombinants and two of two ced-3 unc-26 (+Berg) 
recombinants showed the Bristol pattern. Thus, nP35 
was also found to be located between ced-3 and unc-26, 
about o.l mu on the right side of ced-3. 

A similar analysis using cosmid T10H5 which 
contains the HindHI RFLP (nP36) , and cosmid B0564, 
which contains a HindHI RFLP (nP37) , showed that nP36 
and J1P37 mapped very close or to the right of unc-30. 
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These experiments localized the ced-3 gene to an 
interval of three cosmids. The positions of the RFLPs, 
and of ced-3, unc-30 and unc-26 on chromosome IV, and 
their relationships to the cosmids are shown in Figure 
9. It was has been further demonstrated by 
microinjection that cosmids C37G8 and C33F2 carry the 
unc-30 gene (John Sulston, personnel communication). 
Thus, the region containing the ced-3 gene was limited 
to an interval of two cosmids. These results are 
summarized in Figure 9. 

Complementation of ced-3 bv R e main* Tr- ans f ormat.i nn 

Cosmids that were candidates for containing the 
ced-3 gene were micro injected into a ced-3 mutant to 
see if they rescue the mutant phenotype. The procedure 
for microinjection was that of A. Fire (EMBO J. 5:2673- 
2680 (1986)) with modifications, unc-31, a mutant 
defective in locomotion, was used as a marker for • 
cotransforma'tion (Kim and Horvitz, Genes & Dev. 4:357- 
371 (1990) ) , because the phenotype of ced-3 can be 
examined only by using Nomarski optics. Cosmid C14G10 
(containing unc-31 (+)) and a candidate cosmid were 
coinjected into ced-1 (el375) ; unc-31(e928) ced-3 (n717) 
hermaphrodites, and Fl non-Unc transf ormants were 
isolated to see if the non-Unc phenotype could be 
transmitted and established as a line of transf ormants . 
Young LI progeny of such transformants were examined 
for the presence of cell deaths using Nomarski optics - 
to see whether the ced-3 phenotype was suppressed. 
Cosmid C14G10 containing unc-31 alone does not rescue 
ced-3 activity when injected into a ced-3 mutant. 
Table 4 summarizes the results of these transformation 
experiments. 

As shown in Table 4, of the three cosmids injected 
(C43C9, W07H6 and C48D1) , only C48D1 rescued the ced-3 
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phenotype (2/2 non-Unc transformants rescued the ced-3 
phenotype). One of the transformants, nEX2 , appears to 
be rescued by an extra-chromosomal array of injected 
cosmids (Way and Chalfie, Cell 54:5-16 (1988) ), which 
5 is maintained as an unstable duplication, since only 
50% of the progeny of a non-Unc Ced(+) animal are 
non-Unc Ced(+). since the non-Unc Ced (+) phenotype of 
the other transformant (nisi) is transmitted to all of 
its progeny, it is presumably an integrated 

10 transformant. Ll cecJ-2 animals contain an average of 
23 cell corpses in the head (Table 5). Ll ced-l; ced-3 
animals contain an average of 0.3 cell corpses in the 
head, ced-l; unc-31 ced-3; nisi and ced-1; unc-3l 
ced-3; nEX2 animals contain an average of 16.4 and 14.5 

15 cell corpses in the head, respectively. From these 

results, it was concluded that C48D1 contains the ced-3 
gene . 

In order to locate ced-3 more precisely within the 
cosmid C48D1', this cosmid was subcloned and the 

20 subclones were tested for the ability to rescue ced-3 
mutants (Table 5) . C48D1 DNA was digested with 
restriction enzymes that cut rarely within the cosmid 
and the remaining cosmid was self-li ga ted to generate a 
subclone. Such subclones were then injected into a 

25 ced-3 mutant to look for complementation; young Ll 
non-Unc progeny of the transformants were examined 
using Nomarski optics for the presence of cell death in 
the head. When C48D1 was digested with BamHI and self-" 
ligated, the remaining 14 kb subclone (named C48D1-28) 

30 was found to rescue the ced-3 phenotype when injected 
into a ced-3 mutant (Figure 10 and Table 5) . C48D1-28 
was then partially digested with Bgl II and self- 
ligated. Clones of various lengths were isolated and 
tested for their ability to rescue ced-3. 
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One clone, C48D1-43, which did not contain a 1.7 
kb Bglll fragment of C48D1-28, was able to rescue ced-3 
(Figure 10 and Table 5) . C48D1-43 was further 
subcloned by digesting with BamHI and Apal to isolate a 
5 10 kb BamHI-Apal fragment. This fragment was subcloned 
into pBSKII+ to generate pJ40. pj 4 0 can restore ced-3+ 
phenotype when microinjected into a ced-3 mutant. pJ40 
was subcloned by deleting a 2 kb Bglli-Apal fragment to 
generate P J107. pji07 was also able to rescue the 
10 ced-3 phenotype when microinjected into a ced-3 mutant. 
Deletion of 0.5 kb on the left side of P Ji07 could be 
made by ExoIII digestion (as in pJl07del28 and 
pJ107del34) without affecting ced-3 activity; in fact, 
one transgenic line, nEX17 , restores full ced-3 
activity. However, the ced-3 rescuing ability was 
significantly reduced when 1 kb was deleted on the left 
side of pji07 (as in pJl07dell2 and pJl07del27) , and 
the ability was completely eliminated when a 1.8 kb 
Sall-Bgl II fragment was deleted on the right side of 
PJ107 (as in P J55 and pJ56) , suggesting that this Sail 
site is likely to be in the ced-3 coding region. From 
these experiments, ced-3 was localized to a DNA 
fragment of 7.5 kb. These results are summarized in 
Figure 10 and. Table 5. 

25 ced-3 Transcript 

PJ107 was used to probe a Northern blot of N2 RNA 
and detected a band of 2.8 kb. Although this 
transcript is present in 12 ced-3 mutant animals, 
subsequent analysis showed that all 12 ced-3 mutant 
alleles contain mutations in the genomic DNA that codes 
for this mRNA (see below) , thus establishing this RNA 
as a ced-3 transcript. 

The developmental expression pattern of ced-3 was 
determined by hybridizing a Northern blot of rna from 
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animals of different stages (eggs, LI through L4 larvae 
and young adult) with the ced-3 cDNA subclone pJH8. 
Such analysis revealed that the ced-3 transcript is 
most abundant during embryonic development, which is 
the period when most programmed cell deaths occur, but 
it was also detected during the LI through L4 larval 
stages and is present in relatively high levels in 
young adults. This result suggests that ced-3 is not 
only expressed in cells undergoing programmed cell 
death . 

Since ced-3 and ced-4 are both required for 
programmed cell death in C. elegrans, one of the genes 
might act as a regulator of transcription of the other 
gene. To examine if ced-4 regulates the transcription 
of ced-3, RNA was prepared from eggs of ced-4 mutants 
(nll62, nl416, nl894, and nl920) , and a Northern blot 
was probed with the ced-3 cDNA subclone pJH8. The 
presence of RNA in each lane was confirmed with an" 
actin I probe. Such an experiment showed that the 
level of ced-3 transcript is normal in ced-4 mutants. 
This indicates that ced-4 is unlikely to be a 
transcriptional regulator of ced-3. 

Isolation of -a ced-3 cDNA 

To isolate cDNA of ced-3, pJ40 was used as a probe 
to screen a cDNA library of N2 (Kim and Horvitz, Genes 
& Dev. 4:357-371 (1990)). Seven cDNA clones were 
isolated. These cDNAs can be divided into two groups: " 
one is 3.5 kb and the other 2.5 kb. one cDNA from each 
group was subcloned and analyzed further. pJ85 
contains the 3.5 kb cDNA. Experiments showed that pJ85 
contains a ced-3 cDNA fused to an unrelated cDNA; on 
Northern blots of N2 RNA, the pJ85 insert hybridizes to 
two RNA transcripts, and on Southern blots of N2 DNA, 
pJ85 hybridizes to more than one band than pJ40 (ced-3 



ft 




-56- 

genomic DNA) does. pJ87 contains the 2.5 kb cDNA. On 
Northern blots, pJ87 hybridizes to a 2.8 kb RHA and on 
Southern blots, it hybridizes only to bands to which 
pJ40 hybridizes. Thus, pJ87 contains only ced-3 cDNA. 

To show that pJ87 does contain the ced-3 cDNA, a 
frameshift mutation was made in the Sail site of pJ40 
corresponding to the Sail site in the pJ87 cDNA. 
Constructs containing the frameshift mutation failed to 
rescue the ced-3 phenotype when microinjected into 
ced-3 mutant animals, suggesting that ced-3 activity 
has been eliminated. 



ced-3 Sequence 

The DNA sequence of pJ87 was determined (see 
Figure 4; Seq. ID #18). pJ87 contains an insert of 2.5 
kb which has an open reading frame of 503 amino acids 
(Figure 4; Seq. ID #19). The 5' end of the cDNA 
contains 25 bp of poly-A/T sequence, which is probably 
an artifact of cloning and is not present in the 
genomic sequence. The cDNA ends with a poly-A 
sequence, suggesting that it contains the complete 3' 
end of the transcript, l kb of pJ87 insert is 
untranslated 3' region and not all of it is essential 
for ced-3 expression, since genomic constructs with 
deletions of 380 bp of the 3' end can still rescue 
ced-3 mutants ( P J107 and its derivatives, see Figure 
10) . 

To confirm the DNA sequence obtained from the 
ced-3 cDNA and to study the structure of the ced-3 
gene, the genomic sequence of the ced-3 gene in the 
plasmid pJl07 was determined (Figure 4; Seq. id #18). 
Comparison of the ced-3 genomic and cDNA sequences 
revealed that the ced-3 gene has seven introns that 
range in size from 54 bp to 1195 bp (Figure 5A) . The 
four largest introns, as well as sequences 5' of the 
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start codon, were found to contain repetitive elements. 
Five types of repetitive elements were found, some of 
which have been previously characterized in non-coding 
regions of other c. elegans genes such as fem-l (Spence 
et al., Cell 60:981-990 (1990)), lin-12 (J. Yochem, 
personal communication) , and ayoD (Krause et al., Cell 
63:907-919 (1990)) (Figure 4). of these, repeat 1 was 
also found in fem-1 and oyoD, repeat 3 in lin-12 and 
fem-1, repeat 4 in lin-12, and repeats 2 and 5 were 
novel repetitive elements. 

A combination of primer extension and PCR 
amplification was used to determine the location and 
nature of the 5' end of the ced-3 transcript. Two 
primers (Pexl and Pex2) were used for the primer 
extension reaction. The Pexl reaction yielded two 
major bands, whereas the Pex2 reaction gave one band. 
The Pex2 band corresponded in size to the smaller band 
from the Pexl reaction, and agreed in length with a 
possible transcript that is trans-spliced to a c. 
elegans splice leader (Bektesh, Genes & Dev., 2:1277- 
1283 (1988)) at a consensus splice acceptor at position 
2166 of the genomic sequence (Figure 4) . The nature of 
the larger Pexl band is unclear. 

To confirm the existence of this trans-spliced 
message in wild-type worms, total c. elegans RNA was 
PCR amplified using the SLl-Log5 and SL2-Log5 primer 
pairs, followed by a reamplif ication using the SL1- 
OligolO and SL2-Oligol0 primer pairs. The SL1 reaction" 
yielded a fragment of the predicted length. The 
identity of this fragment was confirmed by sequencing. 
Thus, at least some, if not most, of the ced-3 
transcript is trans-spliced to SL1. Based on this 
result, the start codon of the ced-3 message was 
assigned to the methionine encoded at position 2232 of 
the genomic sequence (Figure 4). 
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The DNA sequences of 12 EMS-induced ced-3 alleles 
were also determined (Figure 4 and Table 3). Nine of 
the 12 are missense mutations. Two of the 12 are 
nonsense mutations, which might prematurely terminate 
the translation of ced-3. These nonsense ced-3 mutants' 
confirmed that the ced-3 gene is not essential for 
viability, one of the 12 mutations is an alteration of 
a conserved splicing acceptor G, and another has a 
change of a 70% conserved C at the splice site, which 
could also generate a stop codon even if the splicing 
is correct. Interestingly, these EMS-induced mutations 
are in either the N-terminal quarter or C-terminal half 
of the protein, in fact, 9 of the 12 mutations occur 
within the region of ced-3 that encodes the last 100 
amino acids of the protein. Mutations are notably 
absent from the middle part of the ced-3 gene (Figure 
5). 

Ced-3 Protein Contains A R e gion Rirh j n serine 

The Ced-3 protein is very hydrophilic and no 
significantly hydrophobic region can be found that 
might be a trans-membrane domain (Figure 6) . The Ced-3 
protein is rich in serine. From amino acid 78 to amino 
acid 205 of the Ced-3 protein, 34 out of 127 amino 
acids are serine. Serine is often the target of 
serine/ threonine protein kinases (Edelman, Ann. Rev. 
Biochem. 56:567-613 (1987)). For example, protein 
kinase c can phosphorylate serines when they are 
flanked on their amino and carboxyl sides by basic 
residues (Edelman, 1987 supra). Four of the serines in 
the ced-3 protein are flanked by arginines (Figure 4). 
The same serine residues might also be the target of 
related Ser/Thr kinases. 

To identify the functionally important regions of 
the Ced-3 protein, genomic DNAs containing the ced-3 
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genes from two related nematode species, c. briggsae 
and C. vulgaris were cloned and sequenced (Figure 7; 
Seq. ID #20 and 21) . Sequence comparison of the three 
ced-3 genes showed that the non-serine-rich region of 
the proteins is highly conserved. In C. briggsae and 
C. vulgaris, many amino acids in the serine-rich region 
are dissimilar compared to the C. elegans Ced-3 protein 
(Figure 7) . it seems that what is important in the 
serine-rich region is the overall serine-rich feature 
rather than the exact amino acid sequence. 

This hypothesis is also supported by analysis of 
ced-3 mutations in c. elegans: none of the 12 EMS- 
induced mutations is in the serine-rich region, 
suggesting that mutations in this region might not 
affect the function of the Ced-3 protein and thus, 
could not be isolated in the screen for ced-3 mutants. 
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Table l 
Rescue c f the Cad -4 
Phenotvps bv Qe raline Transformation 



Genotype 



DNA 
Injected 



Avg. No. cell 
Corpses (LI Bead) 



Mo. 
Animals 
Scored 



ced-1; ced-4; 
unc-31/ nExl 



C10D8 ; 
C14G10 



9.4 



10 



ced-1; ced-4; 
unc-31; nEx7 



C10D8-5 
C14G10 



11.5 



10 



ill 



m 



ced-1; ced-4 
unc-31; nEx8 

ced-1 

ced-1; ced-4 



C10D8-5 
C14G10 

None 

None 
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Table 4 

niimmni-y e f Trap 8 format ion Exp e riments 
Using G oamida in the ced-3 R«q| on 



Cosmid No. of non-Unc Ced-3 Strain nana 

injected tranaformants phenotvpa 

C43C9; C14G10 1 



W07H6; C14G10 3 



C48D1; C14G10 2 + 



MT4302 

MT4299 
MT4300 
MT4301 

MT4298 



+ MT4303 



Animals injected were of genotype: ced-1 (el735) ; unc-31 (e929) 
ced-3 (n717) . 
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Table 5 

The exp ression of ced-3 /+) transformants 



Genotype 



DNA injected 



Average Ho. 
cell deaths 
in LI head 



Ho. 

Animals 
■cored 



ced-1 



23 



20 



ced-1; ced-3 



0.3 



10 



ced-1; nisi 
unc-31 ced-3 



C48D1; 
C14G10 



16.4 



20 



ced-1; unc-31 
ced-3; nJSl/+ 



14.5 



20 



ced-1; unc-31 
ced-3; nEX2 



C48D1; 
C14G10 



13.2 

0 



10/14 
4/14 



ced-1; unc-31 
ced-3; nEXlO 



C48D1-28; 
C14G10 



12 
0 



9/10 

1 of 10 



ced-1; unc-31 
ced-3; nEX9 



C48D1-28; 
C14G10 



12 



10 



ced-1; unc-31 
ced-3; nEXll 



C48D1-43 
C14G10 



16.7 



10/13 



Abnormal cell 3/13 
deaths 



ced-1; unc-31 
ced-3; nEX13 



pJ4 0; C14G10 



13.75 



4/4 



4 



Table 5 continued 



ced-1; unc-31 
ced-3; nEX17 



ced-1; unc-31 
ced-3; nEX18 



ced-1; unc-31 
ced-3; nEX19 



ced-1; unc-31 
ced-3; nEX16 



pJ107del28, 

pJ107del34 

C14G10 



pJ107del28, 
pJ107dell34 
C14610 



pJ107del28, 

pJ107del34 

G14G10 



pJ107dell2, 

pJ107del27 

C14G10 



23 



12.8 



10.6 



7.8 



12/14 

2/14 
9/10 

1/10 
5/6 

1/6 
12/12 



t/ 1 ?^?? ° f the genes used are c *<*-l(el735) , unc-31 (e928) , and ced- 
3 (n/i/j . 



bb 



-66- 

Ecruivalents 

Those skilled in the art will recognize, or be 
able to ascertain using no more than routine 
experimentation, many equivalents to the specific 
5 embodiments of the invention described herein. Such 
equivalents are intended to be encompassed by the 
following claims. For example, functional equivalents 
of DNAs and PNAs may be nucleic acid sequences which, 
through the degeneracy of the genetic code, encode the 

10 same proteins as those specifically claimed. 

Functional equivalents of proteins may be substituted 
or modified amino acid sequences, wherein the 
substitution or modification does not change the 
activity or function of the protein. A "silent" amino 

15 acid substitution, such that a chemically similar amino 
acid (e.g., an acidic amino acid with another acidic 
amino acid) is substituted, is an example of how a 
functional equivalent of a protein can be produced. 
Functional equivalents of nucleic acids or proteins can 

20 also be produced by deletion of nonessential sequences. 
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